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Editorial Note 


It gives me immense satisfaction that we could assemble a group of distinguished 
scholars at this Institute to conduct a seminar on the history of mathematics in India, 
leading to the publication of this volume. This whole endeavour owes very much to 
the initiative and commitment of David Mumford. I am very glad that this provided 
also an occasion for having amidst us the renowned indologist Frits Staal. 

It is fitting that this volume is dedicated to the memory of the two outstanding 
scholars David Pingree and K.V. Sarma who have made pioneering contributions 
to this field. 

It is my pleasant duty to thank all those who have contributed to this volume, 
as well as to the one invisible contributor Jayant Shah (Northeastern University, 
Boston) for his critical reading of all the material in this volume. 


C.S. Seshadri 
Director 
Chennai Mathematical Institute 
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Preface 


It was a great privilege to participate in the Seminar Topics in the History of Indian 
and Western Mathematics which «ook place at the Chennai Mathematical Institute in 
January and February 2008. I am a newcomer in this area and cannot read Sanskrit, 
but I had the opportunity to listen and learn from Sanskrit scholars, mostly Indian but 
one Western (Staal) and absorb their various perspectives. It has been very exciting 
to me over the last five years to learn something of the distinctive Indian approach 
to mathematics, from its beginnings in Vedic times to its wonderful achievements 
in algebra and the Calculus, just prior to the waves of Western invasions starting in 
the 16* century. 

Today, there is a major resurgence of scholarship in Indian mathematics and 
astronomy both in India and the West, led, on the one hand, by a widespread wave 
of renewed interest in India in Sanskrit studies and on the other hand, by the school 
created by my late colleague David Pingree. Until recently, nearly all articles and 
books on the History of Indian Mathematics and Astronomy were nearly impossible 
to find in the West. But now, new translations and critical editions with commentary 
of many of the extant manuscripts are being published and widely distributed in the 
West as well as in India. 

This volume contains analyses of many of the most important topics in Indian 
Mathematics and Astronomy, taken from talks at this seminar. Let me sketch some 
of the ideas from each chapter, highlighting topics which seemed to me especially 
significant. I will follow a roughly chronological order. 

Dani's chapter deals with the oldest extant Indian mathematical works, the 
Sulbasiitras, or “Rules of the Cord", which date from as early as c.800 BCE. These 
are manuals on the geometry needed for erecting the fire altars central to Vedic ritual. 
Dani describes in detail their techniques of geometric algebra, their construction 
for a square whose area is the sum of those of two given squares or whose area is 
the same as that of a given rectangle. These anticipate many of the constructions 
which appear later in Book II of Euclid's Elements, with knotted cords replacing 
the Greek's use of straight edge and compass. In particular, “Pythagoras’s” theorem 
is described here well before Pythagoras. (But note that it also occurs as early as 
c.1800 BCE on Babylonian tablets.) Dani then goes on to describe their approximate 
constructions of circles whose area is that of a given square and vice versa. His last 
section deals with the rational approximation given for the square root of two given 
in three of the four Sulbasutras. Like the Babylonian approximation, it is accurate 
to roughly one part in a million. Dani describes a striking geometric method, first 
proposed by Datta, by which this approximation might have been found. But he also 
proposes that, since they were using very long ropes to actually lay out fire altars, it 
is not impossible that they could have carried outa continued fraction-like algorithm 
using two ropes and repeatedly taking multiples ofthe shorter away from the longer. 
The idea that such algorithms have a very long history connects to Dutta's article. 
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Staal—the other Westerner besides myself in this seminar—considers the origins 
of the number zero and proposes à link of the mathematical Zero with the linguistic 
zero markers which were invented by the great grammarian Panini. The foundations 
of modern computer science as well as linguistics go back to Panini (c.400 BCE). 
Panini invented formal grammar with abstract variables for various parts of an 
utterance and recursive rewrite rules. Sanskrit grammar was well-known to all the 
mathematician-astronomers and is à plausible source for many ideas which were 
later developed in more mathematical ways. Staal focuses on the Sanskrit word 
lopa, "something that does not appear", as a precursor to the idea of zero. As he 
has written in other works, Sanskrit grammar appears to grow out of the precisely 
formalized Vedic rituals. Astonishingly, the Vedas dealt with possible enactments 
of a ritual in which one participant fails to utter some required sentence. The verbal 
Í root lup- of lopa is used to describe this failure: a zero in the ritual. Thus the original 
! zero could be the priest's Japse. 

Raja Sridharan, R. Sridharan and M. D. Srinivas's chapter concerns another 
area of Indian mathematics: the combinatorics which was inspired first by the 
study of Sanskrit prosody and later by the study of musical patterns, both tonal 
and rhythmic. Panini was followed by Pingala (c.300 BCE) who studied Sanskrit 
prosody. Sanskrit was traditionally written in verse and memorized. Each line had 
a characteristic pattern of short and long syllables. Pingala devised a way to order 
all patterns of short and long syllables in a line with n syllables, and, using this, 
(i) to compute the number describing each pattern, (ii) to reconstruct the pattern 
from its number and finally (iii) to compute the number of patterns with a fixed 
number of short, respectively, long syllables. The first and second use the binary 
number system and the third involves calculating the binomial coefficients. This 


beautiful foundation led to much further combinatorial work which the authors Vp 
Survey. For instance the Fibonacci numbers arose when they asked how many rr 


sequences have a given total length if long syllables are given length 2 and short 
length ANREDE discovered the Fibonacci numbers and their recursion relation 
in is T° century (well before Fibonacci). Recursion seems to be a general theme 
which runs through much of Indian mathematics. Most of their chapter concerns the 
dC abad E musical phrases and to musical rhythms, where the combinatorics 
NOME : ; : 3 3 

E mplex and more interesting. This story is beautifully described by the 

My own chapter concerns the introduction of negative numbers both in India, in 


o ee 1n modern Europe. The full arithmetic of negative numbers 
E RERO M Brahma-sphuta-siddhanta in the 6" century and pre- 
RO rers cen maybe even in the accounting practices described in 
UA SR ME a. In contrast, the first place where this is correctly described 

on in modern Europe is in Wallis's Treatise on Algebra in 1685. 


1 y x 
Other Wester 
“sterners wondered whether (-1)- (-1) might be taken to equal —1. 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


X93 


Preface xi 


This seems to me a stunning example of how far two mathematical traditions can 
diverge—though converging in the end. 

Dutta's contribution concerns Indian arithmetic and algebra. More specifically, 
it concerns three algorithms. The first, called kuttaka (the pulverizer) constructs, 
for all positive integers a, b, positive integers x, y such that ax — by equals plus 
or minus the greatest common divisor c of a and b. The first half of the algorithm 
is the same as the Euclidean algorithm, successive subtraction of the less from the 
greater, and the second half works backwards to find x and y—just as we do today. 
This second step was not taken by the Greeks. Kuttaka appears first in rather cryptic 
form in the Áryabhatiya (499 CE) and soon after more explicitly in the works of 
Bhaskara I and Brahmagupta. They use it to construct all positive integral solutions 
of linear equations ax — by = d, a,b, d of positive integers. This had long been 
a concern because of the ancient Vedic method for making sense of the relative 
periods of the day, the lunar month, the year and the periods in planetary motion: 
that at the beginning of the present Kaliyuga, all the planets, the sun and moon were 
all lined up in one spectacular conjunction. 

The second and third algorithms are concerned with “Pell’s” equationx?— Dy? — 
1 which quite plausibly arose in the search for good rational approximations x/ y 
to ./D. The second algorithm, due to Brahmagupta and called the Bhavana, is 
equivalent, in modern terms, to the law for multiplication of the algebraic integers 
x y JD. As they putit, if? — Dy? = m, u?— Dv? = n,thens = xu + Dyv, t = 
xv -- yu solves s? — Dt? — mn. They now played with solutions of the equations 
x2 — Dy? = m for various m and several centuries later, they did indeed find 
an algorithm which always finds solutions with m = 1, the Cakravala. Thus 
they had a complete theory of Pell's equation, modulo one point—a proof that 
this worked. It is interesting that the standard proofs in modern texts are non- 
constructive whereas the Indian mathematicians focused instead on seeking on 
constructive methods and never studied non-constructive arguments. Their idea of 
mathematics was closer to that of applied mathematicians and computer scientists 
than that of pure mathematicians. Indeed the siddhantas (treatises) where these 
algorithms were written down were manuals for actually calculating astronomical 
events. 

Ramasubramanian and Srinivas take up the story of Indian work on Calculus. 
I personally feel this is a story which deserves to be much more widely known 
in the West. Their chapter outlines the millennium long history of these discover- 
ies, culminating in the complete analysis of the basic calculus of polynomial and 
trigonometric functions, their integrals and derivatives and power series for sine, 
cosine and arctangent—and, of course, applications of these to astronomy. The path 
they took to this is quite disjoint from the path that was taken, first by Archimedes 
and later by Newton and Leibniz. It is extremely unlikely that any of Archimedes’ 
work on, e.g. the Riemann sum for the integral of sine, made its way to India. 
Instead the Indian work seems to have taken off with their discovery of the second 
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order finite difference equation for sine in the 5 century CE or earlier (the discrete 
analog of the result that sine solves the harmonic equation y"+y= 0.) This appears 
cryptically in the Aryabhattya, and clearly soon after, as is well described in Section 
5 of Ramasubramanian and Srinivas’s chapter. Also, early was an interest 1n sum- 
ming powers of natural numbers which is described in Section 4. This early work 
revolves around finite differences and the corresponding sums, much like Leibniz's 
starting point for calculus. But the application to astronomy made it clear by the 
10 century that one needed to calculate the “instantaneous velocity" as well as its 
finite difference approximation (see Section 6). In the 12" century, Bhaskara II used 
these ideas to rediscover Archimedes’ derivation of the formula for the area and 
volume of a sphere. This derivation reduces the problem to computing the integral 
of sine. But, interestingly, as an applied mathematician, he completes the proof by 
numerically summing his sine table—even though he knows quite well that cosine 
differences are sines and could have done it this way. The numerical method was 
apparently more convincing! 

The crowning achievements of this work are due to a major genius who is nearly 
unknown in the West: Madhava, who lived in a village in Kerala in the 14" century. 
Only a small fragment of his work survives, but, fortunately, his and his school’s 
work was written up in an unusual expository form in the local language, Malayalam, 
by Jyesthadeva in the 16 century—still over a century before Newton and Leibniz 
did their work. This book, the Yukti-bhasa has only now been translated into English 
by K. V. Sarma with commentary by the authors of this chapter and M. S. Sriram. The 
first volume was released with some ceremony during our seminar. The work of the 
Kerala school is described in Part 2 of the present chapter. Let me only mention that 
in addition to deriving the power series for sine and cosine and the ‘Gregory’ series 


ces 1 T JPN 
7 CEN" Mer ie MN 
their numerical bent appears again in finding a series of ways to estimate the 


remainder so that this becomes a practical tool for calculating z. This led them to 
much more rapidly converging series such as: 


T 1 1 1 
Ionas ueque gne 


Doni S article proposes the compelling thesis that recursion is the central 
ie = technique that runs through all the Indian work in mathematics. He traces 
Faer Pu s grammatical rules and even earlier to Vedic ritual. He points out 
t e whole idea of decimal place-value notation can be seen as a way to describe 
integers recursively. Recursive generation of larger and larger numbers leads to a 
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clear conception of infinitely large and its inverse, to a conception of the infinitely 
small and of the limiting process. The bulk of his article, however, focuses on the 
extensive use of recursion in the Kerala work on Calculus. Like Newton, he sees 
the introduction of power series as an algebraic analog of decimal expansions (x” 
being the analog 10") and he discusses at length the power series expansion 
l 

Ix 
found in the Yukti-bhasa. He then describes the huge step which one finds in the 
Yukti-bhàsá: recursive proofs based on induction: they derive the integral of x" 
using induction on n. Finally, he discusses the derivation of the power series for 
sine and cosine in the Yukti-bhāsā and notes that it is derived by first converting the 
known difference equations for sine in terms of cosine and cosine in terms of sine 
into a summation form. This is the exact analog of converting differential equations 
into integral equations. Jyesthadeva, starting with a crude approximation, then 
recursively back-substitutes each approximation into the summation form to get a 
better one. Passing to the limit, he gets the power series. This back substitution is 
exactly what we do today to solve Fredholm integral equations. 

Finally, M. S. Sriram's Chapter addresses the planetary models in Indian As- 
tronomy. As mentioned above, mathematics was usually studied together with 
astronomy and the two subjects advanced hand in hand. His article concentrates 
on their last model which is due to Nilakantha c.1500 CE, who worked at roughly 
the same time as Copernicus in the West. His model is explained at length in the 
Ganita-yukti-bhasa and is described with modem formulas in Sriram’s chapter. The 
most interesting issue is here is the historical movement from geocentric models 
to heliocentric models. In fact, Nilakantha’s model is ‘essentially’ heliocentric. In 
his model, one starts with the mean sun moving in the ecliptic circle. Then for each 
planet, one takes a plane centered at the mean sun but inclined to it to different 
amounts for each planet and intersecting it at the appropriate nodes. And in each of 
these planes one places the corresponding planet on the Ptolemaic approximation to 
the ellipse: an eccentric circle. Once one overcomes the dense tangle of compound 
Sanskrit words, a very modern model shines through. 

These articles together cover a substantial portion of the exciting History of 
Indian Mathematics and Astronomy. Only a fraction of this has become generally 
known to mathematicians in the West. Too many people still think that mathematics 
was born in Greece and more or less slumbered until the Renaissance. Therefore, I 
hope that this book may serve as a way of bringing to the international mathematical 
community a deeper knowledge of the riches in Indian Mathematics. To scholars, 
however, there is another message: there is much work still to be done to study, edit 
and translate the many ancient manuscripts still surviving in libraries all over India. 
One hopes for a deeper and broader picture of the more than two millennium long 
history when every one of these has been looked at and analyzed. 


= lop dh c e do 


David Mumford 
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David Pingree, 1933-2005 
Kim Plofker 


David Edwin Pingree (2 January 1933-11 November 2005) employed the fifty years 
of his scholarly career investigating the development of mathematics, astronomy 
and the related exact sciences from ancient Mesopotamia to early modern Europe 
and India. He published editions, translations and studies of source texts in Akka- 
dian cuneiform, Greek, Latin, Sanskrit, Arabic and Persian, on subjects ranging 
from infinite series and interpolation techniques to astral magic and iconography 
in astrological texts. He was professionally affiliated with Harvard University as 
an undergraduate (B.A. in Classics and Sanskrit, 1954), graduate student (Ph.D. 
in Sanskrit and Indian Studies, 1960), and Junior Fellow (to 1963); the Univer- 
sity of Chicago as a faculty member in the Oriental Institute and Departments of 
History, South Asian Languages, and Near Eastern Languages (1963-1971); and 
Brown University as a professor in the Departments of Classics and the History 
of Mathematics (1971—2005). Over the course of his immensely productive career 
he received many honors, including a Fulbright Scholarship, a Guggenheim Fel- 
lowship, and membership in several learned societies, among them the American 
Academy of Arts and Sciences, the American Philosophical Society, and the Insti- 
tute for Advanced Study at Princeton University. Pingree was awarded the title of 
“A bhinavavaradhamihira” by the government of Uttar Pradesh in 1979, and in 1981 
was one of the first recipients of the MacArthur Fellowship (popularly nicknamed 
the “Genius Grant”), together with co-honorees including the philosopher Richard 
Rorty, the paleontologist Stephen Jay Gould, and the computer scientist Stephen 
Wolfram. His total scholarly output comprised several dozen monographs and sev- 
eral hundred research articles, reviews, encyclopedia entries, and other works. 

In his research on the history of science, David Pingree was first and foremost 
what might be termed a "transmissionist": he was primarily interested in what he 
called the “kinematics” of scientific development, the ways that scientific ideas were 
passed from one culture to another, and how they were transformed in the process. 
The first awakening of his interest in the history of science as a field of research was 
due, as he described it, to just such a question of transmission from the medieval 
Indian exact sciences to their Greek counterpart. As a Fulbright scholar in 1955, 
reading a Byzantine Greek manuscriptonastrology in the Vatican Library, henoticed 
marginal notes by a commentator that included technical terms transliterated from. 
Sanskrit. This discovery eventually inspired his important studies of the influence 
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of Indian astrology on Arabic and Persian texts and consequently on the Greek 
astral science tradition of Byzantium. Pingree also explored the mirror image of 
this transmission in an earlier era, tracing the assimilation of Hellenistic horoscopic 
astrology into the Sanskrit discipline of yavana-jataka via the pre-Gupta Indo-Greek 
kingdoms in western India. This work, which became his doctoral dissertation and 
his first monograph, led directly to his prolonged collaboration with the great 
Assyriologist historian of science Otto Neugebauer, and to his study of Akkadian, 
Arabic and Persian in order to track similar exchanges of scientific knowledge 
through the entire four-thousand-year history of the great Eurasian development of 
the exact sciences. (Even erudition as deep as Pingree's was forced to submit to 
some limitations in this quest, however: to the end of his life he remained somewhat 
regretful, and slightly apologetic, that he had never found time to undertake learning 
Chinese.) 

Pingree's hypotheses about cross-cultural transmission of scientific ideas were 
frequently groundbreaking and sometimes controversial. A few of them were ten- 
tative speculations that never attained wide acceptance, such as his suggestion that 
the structure of Bhaskara II's table of easy Sines (laghu-jyà) might reflect influence 
from Islamic trigonometry. Some were plausible inferences unproven by conclusive 
textual evidence that were accepted by most mainstream historians but strongly 
resisted by some others, such as his attribution of parts of the Sanskrit Jyotisa- 
vedangal Vedanga-jyotisa to Babylonian mathematical astronomy techniques in the 
Achaemenid period or his derivation of Aryabhata’s planetary mean motions around 
500 CE from a hypothesized Greek source. Some were preliminary conclusions 
based on little-known textual sources that remain to be explored more fully, such 
as his deductions about the relationship of Mesopotamian and ancient Indian astral 
omen literature or about the recognition of Islamic optics in seventeenth-century 
Sanskrit astronomy. But most of his discoveries concerning scientific transmission, 
including his study of the Indian adoption of Hellenistic astrology, his exposition of 
the dependence of medieval Spanish astronomy on Indian sources, and his research 
onthe Teactions to Latin heliocentrism in eighteenth-century Rajasthan, were solidly 
established by masterly textual scholarship and have substantially transformed the 
standard narrative of scientific development. 

In light of the above-mentioned controversies, Pingree has sometimes been 
rashly relegated (by those acquainted with only a few isolated fragments of his 
work) to the company of reactionary Orientalists like the nineteenth-century John 
por F A Bee ie mak i granted that original scientific discoveries 
must represent mere mo sid ae a Sanat on Sune 

gs and imitations. Pingree himself, however, was far 


from Sharing this ill-informed view, which he scornfully dismissed as the vice of 
'Hellenophilia unworthy of a serious historian: 


eae it might be observed, are overwhelmingly Westemers, displaying 
: i myopia common in all cultures of the world but, as well, the arrogance 
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that characterized the medieval Christian's recognition of his own infallibility and 
that has now been inherited by our modern priests of science. . . If it is evident that 
for a historian the proposition that the Greeks invented science must be rejected, 
it necessarily follows that they did not discover a unique scientific method. . . 
Babylonian and Indian mathematics are frequently criticized for relying not on 
proofs but on demonstrations. But without axioms and without proofs Indian math- 
ematicians solved indeterminate equations of the second degree and discovered the 
infinite power series for trigonometrical functions centuries before European math- 
ematicians independently reached similar results. . . Those who deny the validity 
of alternative scientific methods must somehow explain how equivalent scientific 
“truths” can be arrived at without Greek methods. And in their denial they clearly 
deprive themselves of an opportunity to understand science more deeply. (Pingree, 
“Hellenophilia versus the history of science", /sis 83.4 (1992), 554—563) 


It should be noted that the "infinite power series" referred to above were familiar 
to Pingree largely through the Sanskrit editions of the Kerala-school treatises pub- 
lished by his brilliant and indefatigable colleague K. V. Sarma. The historiographic 
approach that they, and the studies in the present work, espouse-namely, the careful 
exploration of the intellectual content of a scientific tradition within its own cultural 
context and in its encounters with other cultures-is what David Pingree recognized 


as the true vocation of a historian of science. 


Department of Mathematics, Union College, Schenectady NY, USA. 
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K. V. Sarma (1919-2005)! 


M. S. Sriram 


Born at Chengannurin Kerala on 27 December 1919, Krishna Venkateswara Sarma 
had his school education in Attingal near Thiruvananthapuram. He completed his 
B.Sc. degree with Physics as the major subject in 1940, from Maharaja’s College 
of Science, Thiruvanathapuram. His family tradition of Sansknit scholarship influ- 
enced Sarma to join the M.A. course in Sanskrit at Maharaja’s College of Arts, 
Thiruvanathapuram, which he completed with distinction in 1942. During 1943— 
51, he was in charge of the Manuscripts Section of the Kerala University Oriental 
Research Institute and Manuscripts Library. It is here that he acquired expertise in 
deciphering and critically editing palm-leaf and paper manuscripts of Sanskrit and 
Malayalam texts. During this period, he prepared an analytical catalogue of nearly 
50,000 manuscripts of the library. 

From 1951 to 1962, Prof. Sarma was in the Department of Sanskrit, University of 
Madras, where he was associated with the project of compiling the New Catalogus 
Catalogorum of Sanskrit Works and Authors, under the direction of the great San- 
skritist V. Raghavan. It was also the time when his life-long pre-occupation with the 
Kerala school of Astronomy and Mathematics began to take shape and he started 
painstakingly collecting manuscripts on Astronomy, Astrology and Mathematics, 
critically editing and translating many of them. Some of his early publications in this 
genre were Grahacaranibandhana of Haridatta, Siddhantadarpana of Nilakantha, 
Venvaroha of Madhava, Goladipika and Grahanastaka of Parameswara. During 
this period, Prof. Sarma also came under the influence of the renowned scholar T. S. 
Kuppanna Sastri, in collaboration with whom he edited the main text of the Vakya 
system, Vakyakarana, with the commentary of Sundararaja. 

At the invitation of Acharya Viswa Bandhu, Prof. Sarma moved in 1962 to 
the Visvesvaranand Institute of Sanskrit and Indological Studies of the Panjab 
University at Hoshiarpur. He served as the Director of the Institute during 1975— 
80 and stayed on at the Institute till 1983. This period of his stay at Hoshiarpur 
was indeed very productive and he published more than 50 books, mostly on the 


1 This is based on the obituary which appeared in Ind. Jour. Hist. of Sci. 41(2006), 231-246, which 
also includes a bibliography of publications of K- V. Sarma. 
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Kerala School of Astronomy. These include very important seminal works such as 
Dreganita of Paramegvara, Golasara of Nilakantha, A History of the Kerala School 
of Hindu Astronomy, Lilavati of Bhaskaracarya with Kriyakramakari of Sankara 
and Narayana, Tantrasangraha of Nilakantha with the commentaries Yuktidipika 
and Laghuvivrti of Sankara, Jyotirmimamsa of Nilakantha, and Ganitayuktayah. 

In 1983, Prof. Sarma returned to South India to settle down in Chennai. His 
important publications during this period include: Indian Astronomy: A Source 
Book jointly with the renowned historian of science B.V.Subbarayappa, Vedanga 
Jyotisa of Lagadha and Paficasiddhantika of Varahamihira, on which he had worked 
in collaboration with T. S. Kuppanna Sastri. From 1990 onwards, Prof. Sarma 
had been working on a critical edition and English translation of the celebrated 
Malayalam work Ganita-yukti-bhasa of J yesthadeva (c.1530 AD). He requested 
K. Ramasubramanian, M.D.Srinivas and M.S.Sriram to prepare detailed explana- 
tory notes in English. Prof. Sarma also edited a Sanskrit version of Ganita-yukti- 
bhasa which appeared in 2004, and compiled an important catalogue, Science 
Texts in Sanskrit in the Manuscripts Repositories of Kerala and Tamil Nadu, 
which includes a list of nearly 3,500 works related to science and technology. 
In fact, he continued to be relentlessly active till his very death (on January 13, 
2005). 

Prof. Sarma has to his credit several publications also on diverse aspects of 
Sanskrit learning such as Vedas, Itihasas and Puranas, Dharmaéastras, etc. In fact, 
he has authored more than 100 books and 500 articles. His outstanding contribution 
consists in searching for and bringing to light many of the seminal works of Kerala 
Schoolof Astronomy, which show that the tradition of Mathematics and Astronomy 
continued to flourish till late middle ages at least in the South of India. They also 
present a detailed view of the methodology of these sciences, on issues such as 
justification of mathematical and astronomical results and procedures, and the 
importance of continuous examination and revision of planetary theories. It is 


mainly due to Prof. Sarma’s painstaking work on primary sources that the work of 


the Kerala School has been brought to the attention of historians of Mathematics, 
and Ee a new perspective on Indian contributions during the late medieval 
period. 

Just as in the case of his illustrious predecessors such as Bibhutibhushan Datta, 
Avadhesh Narayan Singh and others, and his own contemporary and collaborator 
K.S.Shukla, Prof. Sarma did not receive even in his own country, the recognition 
and accolade, which he richly deserved. He was of course awarded the D.Litt degree 
a Panjab University in 1977 and, in 1992, was bestowed the Certificate of Honour 

y the President of India. He was also conferred the honorary degree of Vacaspati 
by Kendriya Samskrita Vidyapeetham, Tirupati, in 2003. : 

p acd e the wish that his legacy should continue, founded the Sree 
Ec. a n Society & Research Centre during the 90s, donating his lifetime 

! gs and invaluable collection of books and manuscripts; since his demise, 
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the Centre has published critical editions of some of the texts that he had been 
working on during his last days. One hopes that the Centre will receive support and 
encouragement from scholars and funding agencies so as to sustain the torch that 
Prof. Sarma lit. 


Department of Theoretical Physics, University of Madras. 
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Geometry in the Sulvasütras 


S. G. Dani 


Sulvasütras are compositions pertaining to the fire rituals performed by the Vedic Indians. The 
rituals involved constructions of altars and fireplaces in a variety of shapes, involving geometric 
theory. Some of the theory is explicitly enunciated, while some other aspects of the knowledge at 
that time can be inferred from the constructions. We present here an overview of the geometric 
ideas contained in the Sulvasütras. 


Yajnas, or fire rituals, formed an integral part of life in the Vedic culture, going 
back to 1500 BCE or earlier, and extending until about the sixth century BCE. 
Some of these concerned sacrifices to be performed regularly by a householder 
(grhastha), while performance of certain others was prescribed for bringing about 
fulfilment of specific aims or desires, which included both material (acquiring 
cows, vanquishing an enemy etc.) and transcendental (securing place in heaven) 
aspects. 

In view of the great significance attached to the yajnas meticulous attention was 
paid to a variety of details in their planning and execution. The rituals involved 
construction of altars (vedi) and fireplaces (agni) in a variety of intricate shapes. 
Over a period the procedures for construction of the shapes appear to have got 
more formalised and acquired a degree of sophistication in geometrical terms. The 
Sulvasütras mark the peak in the geometrisation of the altar building activity of the 
Vedic era. 

The Sulvasütras are often referred to as ‘manuals’ for construction of the altars 
and fireplaces. While there is a certain valid analogy here, it should be borne in 
mind however that the contents are not limited to prescribing steps or procedures 
for the construction of the altars and fireplaces. They also describe various geo- 
metric principles involved, and set up a body of geometric ideas. In Baudhayana 
and Apastamba Sulvasütra there are separate sections devoted to geometric 
theory. 

The Vedic people were a heterogeneous community, with many Sakhas 
(branches), having nevertheless a common cultural identity. The different sakhas 
had their versions of Sulvasütras, transmitted orally from generation to genera- 
tion within the community (branch). There are nine extant Sulvasütras of which 
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four, Baudhayana, Apastamba, Manava and Katyayana Sulvasütras are of signifi- 
cance from a mathematical point of view. The dates of the Sulvasütras are uncer- 
tain, but it is generally believed that they were composed sometime during the 
period 800 — 200 BCE; for the individual Sulvasütras the ranges would be, 
Baudhayana (800 — 500 BCE), Apastamba and Manava (650 — 300 BCE) and 
Katyayana (300 BCE — 400 CE), according to Kashikar, as quoted in [11]. 

The root sulv means ‘to measure’, and the name Sulvasütra would correspond 
to “theory of mensuration” (see [4]). The word silva also means ‘rope’ in Sanskrit, 
which indeed was a major equipment employed in measurements. In the body of 
the Sulvasütras the word sulva does not appear; instead the word rajju is used for 
rope. On the other hand, while most measurements involved in the constructions 
were indeed carried out with ropes, there are instances where a bamboo rod was 
used instead. This suggests that the name Sulvasiitras indeed was meant to convey 
mensuration in the conceptual sense, and not only as operations with the rope. The 
meaning of Sulva as rope is presumably a later development. 

Like other Vedárigas (appendages of the Vedas) the Sulvasütras are composed in 
the sūtra (aphoristic) style, characterised by short sentences with nouns often com- 
pounded at great length and verbs avoided as much as possible, rather than running 
prose, presumably for reasons of convenience in reciting them. The text, which is in 
prose form in other respects, has been divided by later commentators into convenient 
segments, treated as individual sutras, and grouped into Chapters. As presented in 
[19] (the scheme which we shall follow in the sequel for reference), Baudhayana has 
21 Chapters adding to 285 sütras, Apastamba has 21 Chapters adding to 202 sütras, 

: Manava has 16 Chapters adding to 228 sütras, and Katyayana has 6 Chapters adding 
to 67 sütras. 

There is a considerable overlap in the contents of the different Sulvasütras in- 
dicating that the works are expositions from a common stream of knowledge. 
There are also significant differences, which may be attributed to the different 
branches they come from and the difference in their period. For general refer- 
ence in this respect the reader is referred to [19]; (see also [1], [6], [13], [14] 
and [16]). 

The contents of the Sulvasütras can be broadly categorised into two groups, 
one consisting of Geometric theory, and another dealing with various details about 
the constructions of various vedis and agnis; in Baudhayana and Apastamba the 
sections dealing Geometric theory are arranged in the beginning. From the other 
part also one can draw some inferences about the geometric knowledge at that 
time. The preponderant aspect in this part however is the description of the “Ar- 
chitecture” of tilings involved in the construction of complex figures needed for 
the vedis and agnis; (Katyayana Sulvasütra however consists mostly of theory 
part - this also explains its being shorter than the others). The designs of some 
of the special fireplaces involve elaborate figures resembling falcons and other 
birds, tortoise, chariot wheels, circular trough (with a handle), pyre, etc., whose 
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constructions are described, quite elaborately, in the form of tiling by bricks in 
certain primary shapes. Apart from the issue of achieving likeness with the de- 
sired figure, in terms of rectilinear constructions, there are also other stipulations 
involved, such as the number of bricks to be used etc.. On account of these there 
are also some arithmetical and combinatorial features involved, in a rather scattered 
form, in the architectural description of the layout of the fire altars. A study of 
this aspect would be of interest. We shall however not concern ourselves with it 
here, and will confine to geometry in the Sulvasitras, including the principles and 
constructions described explicitly, as well as those which can be seen to be involved 
implicitly. 

The geometric contents from the Sulvasütras will be discussed in the following 
sections taking up various themes. Before going over to the main contents, a few 
words would be in order regarding the units of measurement involved. They had 
various units for measures of lengths. Measures of many of the altars are given 
in terms of purusa (meaning man), which was about 71 feet, stipulated as the 
height attained by the performer of the sacrifice, yajamána, with uplifted arms. 
A commonly occurring small unit is arigula (meaning finger, in width); purusa 
comprised of 120 arigulas, so an arigula was about i th of an inch. pada (meaning 
foot, given as 15 arigulas in Baudhayana and 12 arigulas in Katyayana), prádesa 
(12 angula), aratni (24 arigula) are some of the other length measures that occur 
frequently. 

The Baudhayana Sulvasütra gives in the beginning (sūtra 1.3) names of 18 
different units of length measure. The smallest among them, tila (sesame seed) 
is 4 of an arigula. It was postulated by Thibaut (see [20], page 15) that the unit 
owes its origin to the fact that they had a formula for 4/2 involving the fraction DE 
(the formula will be discussed later). Many of the intermediate units do not bear a 
simple fractional relation with purusa however; e.g. a bahu is 36 arigulas, a yuga 
is 86 arigulas, etc.. The units must have arisen from the context of performance of 
specific vedis and many of them occur infrequently. 

The other Sulvasütras use many of the units described by Baudhayana but there 
is no systematic listing or a comprehensive statement on their interrelations as 
in Baudhayana Sulvasütra. Some other measures, vitasti, arvasthi, anuka are also 
mentioned in Apastamba Sulvasütra. 

For the area of rectilinear figures they had the notion as we have today. They 
were aware that for similar figures the ratio of the areas equals the square of the 
ratio of the lengths of the corresponding sides, as is clear from usage of the idea 
at various places; in Apastamba Sulvasütra there i also an elucidation of this with 
some examples, including with fractional sides 15 and 25 (sutras 3.6 to 3.9). The 
square units and the corresponding linear unit were known by the same name, the 
meaning being understood from the context; e.g. purusa could dri) the unit of 
length as well as the area of the square with that length (“square purusa”, so to 
Speak), depending on whether it referred to length or area; we shall also adopt this 
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as a convention in the sequel when referring to the Sulvasütras units, rather than 
prefixing the term "square". 


1. Construction of Rectilinear Figures 


Though inevitably there are some 3-dimensional features to the fireplaces described 
in the Sulvasütras, the geometric ideas of significance chiefly concern planar geom- 
etry. These involve the concepts and construction of rectilinear figures such as 
squares, rectangles, symmetric (isosceles) trapezia and triangles, rhombuses, as 
well as circles, as primary figures. 

The rectilinear figures sought to be drawn had a bilateral symmetry; viz. isosceles 
triangles, symmetric trapezia, rectangles. The east-west line served as the line of 
symmetry. Towards construction of these figures with prescribed sizes for the sides, 
the sutras principally describe steps to draw perpendiculars to the line of symmetry; 
these are however packaged into complete procedures for drawing the desired 
figures, as may be seen in some examples discussed below. The issue of drawing a 
perpendicular to a given line at a given point on it is addressed in the Sulvasütras in 


two essentially different ways, involving the following principles (described here 
in modern formulation): 


e given two circles with equal radii intersecting each other, the line joining their 
(two) points of intersection is perpendicular to the line joining their centres, 
at the midpoint of the line segment joining the centres. 


e (converse of Pythagoras theorem) in a triangle with sides with lengths a, b, c 


ifc? = a? + D? then the sides with lengths a and b are perpendicular to each 
other. 


These principles are not enunciated in the Sulvasütras, though they are implicitly 
at work in their constructions (see however the discussion at the end of §2). A large 
number of sütras describe constructions of squares and trapezia via application of 
one of the above, for various specific given sizes, as we shall see in some detail 
below. 

The first statement as above is of course what we commonly use for drawing 
perpendiculars, to a given line at a given point, in Euclidean geometry: take two 
points equidistant from the given point and draw two intersecting circles with these 
as centres and join the points of intersection. This procedure is involved in various 
constructions in Sulvasütras for drawing perpendiculars in the same way as we now 
do. It is however not isolated as a procedure for drawing perpendiculars, but forms a 
part of the package prescribed for construction of various figures (in the framework 
as indicated above): thus a construction of a square in Baudhayana (sutras 1.22 to 
1.28 [19] consists of the following steps, described as an aggregate (see Figure 1): 


i) take a rope of the desired side of the square and mark the midpoint; 
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ii) place a pole at the desired midpoint (say at P as in Figure 1a) and tying the 
ends of the rope to it draw the circle around it by the mark (at the midpoint) 
and place poles at the points where the circle meets the east-west line, (A 
and B as in Figure 1); 


Figures la and 1b: Beginning of Baudhayana construction of a square 


iii) draw circles with centres at points A and B by the length of the rope, and 
mark the points where the line joining their points of intersection meets the 
original circle, to obtain the north-south line and mark the points where the 
line meets the circle as in (ii) (C and D as in Figure 1b); 


(iv) draw circles by the mid-point on the rope, tieing both its ends at A and then 
at D and mark the point where they meet (Q as in Figure 1d); this is one of 
the vertices of the square, and the others can be obtained similarly. 


Notice that the first three steps are designed to draw the perpendicular bisector to 
the line of symmetry and marking the midpoints of the sides of the desired square, 
and in (iv) these are used to produce the vertices of the square. 

Other constructions are also described in a similar vein, as a package for produc- 
ing the intended figure, without reference to steps involved in each other. 

The procedure for drawing the circles involved in the above construction is by 
tying one end of a rope to a pole placed at the point chosen to be the centre and 
tracking the point at a distance equal to the selected radius; this plays the role of the 
compass as used in school geometry now. 

The same principle as above was also used with a rope in another way (see 
Figure 2): given a line and a point on it, say P, to draw the perpendicular to the line 
at P, take two points on the line equidistant from P, on either side of the line, affix 
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Figures 1c and 1d: Completion of construction of the square 


_ poles at the two points and tie a rope (loosely) at the two poles; then stretching the 
rope holding at its midpoint, points are marked on either side where it lies on the 
ground; the line joining these two points is the desired perpendicular to the original 
lineat P. 

Itmay be seen that though the procedure is different the same principle underlies 
this construction. This variation for producing perpendiculars also appears as part 

_ of construction of some rectilinear figures; in particular Baudhayana construction 

tangle given i in sutras I.36 to I.41 adopts this procedure. 
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units) for construction of perpendiculars, but this has now been discounted (see [8], 
Appendix 5). It is not clear whether there is any other instance historically of using 
converse of Pythagoras theorem for the purpose of drawing perpendiculars. 

Let me now describe in some detail how this application was made, and discuss 
the possible convenience for which it was preferred. Take a rope, with endpoints 
marked P and Q, with length a (see Figure 3). Let R be the midpoint of P Q (on the 
string), S the midpoint of QR and X the midpoint of RS. Now tie the two ends of 
the string to two poles placed a distance a/2 along the line to which a perpendicular 
is to be drawn, with the Q end at the point where the perpendicular is to be drawn. 
Let A and B be the points on the plane (ground) where the ends P and Q are tied. 
Now stretch the rope, holding it at the point X as above, on one side of the line AB, 
and mark the point on the plane where X lies, say C (see Figure 3(c)). Notice that 
B and C are at a distance 3a/8 and A and C are at a distance 52/8. Thus the sides 
BC, AB and AC are in the proportion 3 : 4 : 5, and since 3? -- 4? — 5?, by the 
converse of the Pythagoras theorem the angle ZA BC is a right angle. We have thus 
constructed a perpendicular to the line A B at the point B. 


Sai 


Figures 3a, 3b and 3c: Construction of perpendicular by the Nyanchana method 


m 


x 


[^] 


Q 


A rope with markings as above, which can be preserved, can thus be used as an 
instrument to draw a perpendicular, essentially at one stroke; the distance a/2 to 
be kept between A and B is also available by a mark on the rope as the distance 
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2 between P and R. In directing a yajamana towards drawing a perpendicular (as a 
step in the construction of the vedi), it would be simpler for the priest to use this 

approach, than those with the orthogonality principle discussed above. 
| The above procedure depends on the fact that a triangle with sides in the pro- 
B 2 portion 3 : 4 : 5 is a right angled triangle, by the converse of the Pythagoras 
theorem. The Sulvasütras describe also analogous procedures using in place of 
(3, 4, 5) other triples (a, b, c) such that a? + b? = c?; typically they are triples of 
integers, that we now call Pyrhagorean triples, and their multiples by a fraction, 
but occasionally some incommensurable triples are also involved. As in the above 
procedure it involves marking a point X so that when the rope is stretched holding 
| at that point we would get a right angled triangle. Such a point is called Nyanchana; 
f Nyanchana means “lying with face downwards” and in this context signifies that 
i the marked point is to be plotted on the ground. A procedure involving use of the 
| triple (5, 12, 13) goes as follows: having chosen a distance a between the poles, a 
| rope of length one and half times the measure is taken (thus extending the rope by 
a/2), and the Nyanchana mark is set at a distance a sixth of the extended piece, 
namely a/12 from the joining point. The Nyanchana mark then divides the string 
in the proportion 5 : 13, and steps analogous to those described above will yield a 
| perpendicular at the pole on the side of the shorter segment. It may be noted that the 

I Pythagorean triple is unrelated to the length a in either case. 
In [3] I have discussed the theme of Pythagorean triples with regard to the 
Sulvasütras. Here I will therefore introduce it only briefly to put the topic in per- 
spective. 

| The main role of the Pythagorean triples in Sulvasütras was theiruse in producing 
| perpendiculars via the converse of Pythagoras theorem. The two triples (3, 4, 5) and 
(5, 12, 13) are a common occurrence in this respect in the Sulvasütras. These are 
primitive triples (there is no common integer factor greater than 1). Some multiples 
of these triples (non-primitive) were also commonly in use; the triple (15, 36, 39) 
s seems to have been an especially familiar one, and perhaps much older than the 
/asütras Leas (see [3] for some observations on this). In Apastamba two 


> shap deis symmetric trapezium with a base of 30 units, 
op] alis to the base) of 24 units (see Figure 4); to 


20 meters. 
h cs based on the Nyan- 
ne EH (“one- 


24e 


d12u units 


Geometry in the Sulvasütras 17 
Cc E 12 D 
A Ww 15 B 
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portion of 6 units as the Nyanchana mark. Thus we have a subdivision into 15 and 
39 units, and the desired height being 36 units this enables fixing the two vertices A 
and B at the base by the Nyanchana method (through triangles EW A and EW B. 
The vertices at the face C D are also obtained in the same way, by plotting the mark 
at 12 from the end rather than the original Nyanchana mark. The other constructions 
involve two cords. 

It may be noticed that the diagonals of the trapezium meet at a point M which 
together with the base and face makes isosceles triangles AM B and C M D whose 
symmetric half parts have lengths given by the triples (15, 20, 25) and (12, 16, 20) 
respectively. Apastamba's second procedure consists of drawing the two triangles 
using that these are multiples of the triple (3, 4, 5). 

The third construction is based on the triple (5, 12, 13) and its multiple 
(15, 36, 39), using that half the base and half the face, viz. 15 and 12, occur in 
these triples; unlike in the first two constructions one of the vertices of the right 
angled triangle involved (the one on the east-west line) is of no significance to 
the diagram itself. It is the last construction in which the triples (8, 15, 17) and 
(12, 35, 37) appear. Again since 15 and 12 occur in these, the vertices at the base 
and the face may be plotted with these triples, respectively, with poles at distances 
of 8 and 35 from the base and face respectively. Manifestly the construction would 
no longer be as elegant as the earlier ones, and Apastamba could not have missed 
noticing that. From the overall context it is clear that the aim has been to quench 
the curiosity about the various ways the Nyanchana method could be used to plot 
the vertices, and no practical requirement is involved. This involves finding triples 
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with 15 and 12 as one of the first two entries. Incidentally, Apastamba exhausts 
such triples. There is no clue however whether this was known, and if so how it was 
realised. 

In Baudhayana, while only the first two primitive triples, viz. (3, 4, 5) and 
(5, 12, 13), are found used in the constructions, there is a list of 5 primitive 
Pythagorean triples (or rather the first two terms of each, which of course de- 
termines the third term) given, following the statement of Pythagoras theorem. 
Apart from the four triples as in the above discussion the triple (7, 24, 25) also 
forms part of the list; the (non-primitive) triple (15, 36, 39) is also included along 
with the others, which is possibly due to the familiarity with it in a wider context, 
and its association with tradition on account of its being involved in the Mahavedi 
(see [3] for a discussion on this). The wording of the sütra, its location and the 
overall context indicate that the listed triples are given as illustrative examples for 
the Pythagoras theorem.! 

There are no other primitive Pythagorean triples found in the Sulvasütras. It 
would seem that though they may have the means of producing more triples, if not 
an infinite family, they would have had no motivation for it, with the ones that occur 
having specific objectives that are adequately met in their context. 

The Nyanchana procedure was also used in a construction with an incommen- 
surable triple (not a multiple of Pythagorean triples); the construction is also in- 
teresting from another point of view (see below). To construct a square of side a 
Apastamba gives the following procedure: on a rope mark three points, two end- 
points and a point in between whose distance from one endpoint is a/2, and from 
the other endpoint it is equal to the length of the diagonal of the square with side 
a/2. The rope is now tied to two poles, one at the desired centre of the square 
and the other at the point at distance a/2 on the east-west line, by the endpoints 
as above, with the longer side (from the in between mark) being attached to the 
centre. The rope is then stretched, holding it at the middle marked point. Where it 
lays on the ground is one of the vertices of the desired square; the other vertices 
are plotted similarly. The procedure presupposes being able to mark a point at a 
distance equal to the diagonal of a square of side a/2, so the construction is in a way 
"circular", from the point of view of logical development. However, the diagonal 
of a square was such a common occurrence in their practice that it was I suppose 
treated as a “tangible” quantity. Having once constructed a square one could produce 
ropes with markings as required, and then employ them for later construction of 
squares following the above procedure. It has been suggested by some authors (see 
Footnote 4) that an approximate numerical expression for /2 was used to get the 


‘indeed, a Pythagorean triple does not, strictly speaking, illustrate Pythagoras theorem, since one 

would need to know that the triangle corresponding to the triple as side-lengths is a right angled triangle; 

the nuan illustration here would be more like "when you draw rectangles with sides 3 and 4, 5 

and 12, - +» the diagonals will be 5, 13, ..., and you see that the area produced by the diagonal is the 
^ sum of the that of the squares on the sides." 
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diagonal of the square with side a/2; this however seems unlikely, considering the 
common usage of ropes all around, which furthermore would give a more accurate 
measure, without the cumbersome subdivision that would be involved in producing 
the approximate value using the formula for V2 (see 86). 

For the constructions of the figures the Sulvasütras adopted meticulous proce- 
dures, involving drawing the perpendiculars, which was accomplished by the meth- 
ods that we discussed above. Surprisingly, despite extensive use of these methods, 
conceptualisation of the perpendicular or right angle seems to have eluded them. 
Absence of the concept may have led to view the task of drawing each symmetric 
trapezium with different given dimensions individually, as is noticeable especially in 
Apastamba Sulvasütra. While there is a degree of unity in the descriptions, with the 
concept of the perpendicular a more uniform prescription could have been given for 
the constructions. Furthermore in several constructions, including in the Ma/tavedi 
as seen above, special Pythagorean triples (not necessarily primitive) were sought 
depending on the desired sizes. While the latter may have offered an amusing di- 
version, in practice it would have been simpler to have a unified way of drawing 
perpendiculars, even with Nyanchana method if that was found more convenient, 
and marking the point on the perpendicular line at the desired distance, producing 
the line if necessary. This possibility does seem to have been realised at some stage. 
In [3] I have noted that the vedis described in the Asl. 6.3-6.4 (nirüdhapasubandha 
vedi), Asl. 6.6, Asl. 6.7 (paitrki vedi), Asl. 6.8 (uttara vedi), and Asl. 7.1 involve 
varied shapes (two trapezia of different dimensions, two squares and an oblong rec- 
tangle), but the construction of each of them refers to “Having stretched (the cord) 
by the mark at fifteen" (pancadasikenaivapayamya), and taking various contextual 
factors into account concluded that the phrase is used as a way of saying "Having 
drawn a perpendicular"; the desired point on the perpendicular line is meant to 
be marked on that line by measuring out the requisite distance. This marks a step 
towards conceptualisation of the perpendicular at a practical level, which however 
does not seem to have been abstracted further. 


2. Pythagoras Theorem and its Applications 


The most notable feature of the Sulvasütras in terms of geometric theory is the 
statement of the so called Pythagoras theorem. This stands out especially in the 
context of the fact that some of them, especially Baudhayana, predate Pythagoras. 
There has been a variety of speculation in this respect, including that Pythagoras 
may have got it from the Indians (A. Bürk quoted in [10]) or, in broader terms, 
that there may have been a common source for the geometry of the Greeks and the 
Indians (see Seidenberg [17] and [18]). Available inputs seem inadequate to have a 
meaningful discussion on this, and in any case we will not go into this aspect here. 
The main discussion below will pertain to the role of the theorem in the overall 
context of the Sulvasütras themselves. 
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Sulvasütras do not contain proofs, and there are no indications of how the 
statements were arrived at; in this respect the analogy with manuals is pertinent. 
The idea of proof in the sense of Euclidean geometry does not really apply to 
the context of the Sulvasütras, since they were essentially concerned only with 
the practical aspects, and at best one can ask how they would have reached the 
individual conclusions, say in terms of practical geometry. While in the case of 
many statements the knowledge may be attributed to geometric intuition, it is 
difficult to see how they may have come to the statement of something like the 
Pythagoras theorem. We will discuss some of the possibilities that have been pro- 
posed in this respect. At the end of the section I propose a new hypothesis on 
how the Pythagoras theorem may have been thought about in the context of the 
Sulvasütras. 

Neither the notion of a right angle nor of a right angled triangle are found in 
the Sulvasütras, as concepts; of course right angled triangles appeared as parts 
of various figures, and were implicit in the Nyanchana operations, but were not 
identified separately. Thus statement of the Pythagoras theorem occurs not with 
respect to right angled triangles, but rather with reference to rectangles. A close 
translation of how it is stated in Baudhayana would be “the diagonal of a rectangle 
makes as much (area) as the (areas) made separately by the base and the side put 
together". An identical statement is found in Apastamba. The statement appears 
also in Katyayana where it is followed by a clause “iti ksetrajnànam", meaning 
"this is the knowledge about areas". In Manava however an equivalent statement 
occurs in quite a different form. A close translation would be “multiply the stretch 
by the stretch and the width by the width; the square root of the sum is how much 
the diagonal is". The difference is striking. While the former presents the statement 
about areas, the latter describes it as an algorithm for computing the length of the 
diagonal. Together with the fact that many geometric constructions described in the 
other Sulvasütras are missing from Manava, this suggests a shift from a geometric 
approach to either algorithmic or functional approach. We shall however see later 
that Manava also contains some interesting original geometric statements not found 


_ in the other Sulvasütras. 
E AD Pythagoras poem is also applied in the Sulvasütras towards the following 


: e given ‘squares are of equal size, Andres when 
givens as the diagonal of the square. 
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Figure 5: Square with area equal to the sum of two given squares. 


To describe a square whose area is the difference of areas of two given (unequal) 
squares: For this again one marks the side of the smaller square in the larger one 
from a corner, forming a rectangle as before. An arc is drawn with one of the 
vertices of the rectangle as the centre and the longer side as the radius, meeting the 
opposite side of the rectangle. The portion of that side that is cut out, from the vertex 
nearer the point picked as the centre, is the answer (see Figure 6 ; BC as in the 
figure is the prescription for the side of the difference of the squares with sides P A 
and P B). 

It may be seen that the cut out portion together with the smaller side of the 
rectangle are sides of a rectangle whose diagonal equals the side of the bigger 
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Figure 6: Square with area equal to the difference of two given squares. 
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square, and hence by the Pythagoras theorem the square over it has area equal to 
the difference of the two given squares. 

The summing or augmentation of squares with other squares was involved in 
producing figures with multifold area than a given figure with the same shape. For 
this the procedure in the construction of the figure would be followed as before, 
with an enlarged unit. To double the area one would replace the unit by the size 
diagonal of the unit square; this was referred as dvikarani. For tripling the area one 
would replace it by the side of the square obtained by augmenting a square with 
area two with the unit square (trikarani), and so on. 

The square with area equal to the difference of two squares does not seem to have 
had any direct application in the construction of altars or fireplaces. It was however 
involved in meeting the following geometric objective, which in turn was needed 
in practice as we shall see below. 


To transforma rectangle into a square with the same area’: For this purpose, forming 
the square over the smaller side, the remaining rectangle is divided into two equal 
parts parallel to the square and the farther part is then taken off and put adjacent to 
the other side of the square (see Figure 7). 


e 


nto 


a gnomon, viz. a difference of 
igure 7). How to transform 
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this into a square is already described; this is how they put it also (!); tasya nirhara 
uktah, “resolution of this has been explained", so states Baudhayana. 

As rectangles could thus be transformed into squares one could in turn augment 
a square with a given rectangle. This was involved in the following. Various yajnas 
were to be repeated periodically and for each successive performance, the area of 
the fireplace (agni) had to be enhanced, while maintaining its shape. In many cases 
the first fire altar had area 75 purusas, followed by 84,94, and so on, and it could go 
upto 101 i purusas?. To construct a fire altar with an enhanced area, one would Carry 
out the same procedure as before with the original unit replaced by an appropriate 
larger one. This would apply independently of the (often intricate) shape of the fire 
altar. To increase the area from 74 purusas to 8i purusas, for instance, the unit 
would have to be increased so that zh purusc area would be added to each purusa 
in forming the new unit. The latter could thus be obtained as the side of the square 
combining a square purusa with a rectangle of area zth purusa. 

It may be recalled that the converse of the Pythagoras theorem is also a simple 
consequence the theorem itself. Use of the converse in the constructions as seen in 
the last section may also be viewed as an application of the Pythagoras theorem. 
Actually it would seem that the satrakaras viewed the converse as a part and parcel 
of the theorem; clubbing the converse with the direct statement is not uncommon 
in the course of intuitive thinking. Otherwise it is difficult to explain why they 
would make it a point to state the direct theorem (as is done in all the Sulvasütras) 
but not the converse that is much more used. There are some other points which 
seem to corroborate this. Following the statement of the Pythagoras theorem in 
Apastamba Sulvasütra there is a clause tabhirjneyabhiruktam viharanam. This 
translates as “the above mentioned constructions are knowable (deducible) from 
this". The constructions referred to are of squares by the Nyanchana method, using 
the triples (5, 12, 13) and (3, 4, 5) respectively, which involves the converse of the 
Pythagoras theorem. Thus the converse seems to be viewed as an integral part of 
the theorem itself, though not stated as such. Similarly, as mentioned earlier, in 
Baudhayana the statement of the Pythagoras theorem is followed by a list of the 
Pythagorean triples, and there is a connecting clause ityetasüpalabdhih, translating 
as "this is obtained in". As the Pythagorean triples are more closely connected with 
the converse theorem this would also suggest that the converse was clubbed together 
with the theorem itself. 

Let me now come to the question of how the validity of the Pythagoras theorem 
may have been concluded? As there are some Pythagorean triples occurring in the 
Sulvasütras it has been suggested that observation in a few cases was extrapolated 


3This is according to [19] and [11]. It however seems to this author that the original sutra translates 
as going upto 1084 purusas, adding one purusa 101 times; this of course has no consequence to any 


mathematical discussion. 
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into a general statement (see [21]). This explanation overlooks the point that know- 
ing a Pythagorean triple is not tantamount to knowing a right angled triangle (or 
rectangle); a Pythagorean triple would be of significance only after one has arrived 
at the Pythagoras theorem, and not lead to it. Datta [4] (see also [19]) has proposed 
possible geometrical arguments they may have known. The connection of the con- 
struction involved however seems rather distant from the Sulvasütras constructions 
for the argument to be convincing. 

The following possibility does not seem to have been considered in literature. 
From the context of the Sulvasütras we see that they faced the question as to how 
the squares add up (rather than the other way question as to what is the size of the 
diagonal, or the square over it)? Augmentation of squares, for the purpose of the 
working out the size of the enhanced unit as explained earlier, was in all likelihood 
the motive behind the discovery of the theorem, as has been noted already by 
Seidenberg [18]. If one were to contemplate on the question in the old times prior to 
the theorem being discovered, with the conviction of a mystic that there has to be a 
natural choice, it seems natural that the diagonal of the rectangle with the two sides 
of the two squares (perhaps the one emanating from the common point when the 
squares are set on the two sides of the rectangle) would pose as a natural candidate to 
give the same area*; what else can it be? Experience with square tilings would also 
aid such an intuition. Having guessed the answer it could be confirmed in various 
ways; it does not even have to be only with triangles with rational sides, though the 
latter would indeed facilitate verification in some ways. 


3. Transforming a Square into a Circle 


Towards the end of the purely geometric part Baudhayana gives (in sutra 1.58) a 
geometric construction to produce a circle with same area as that of a given square. 
Essentially the same procedure is described in the other three Sulvasütras, though 
the wording in each of them varies a little, including in usage of the geometric terms. 

Finding a circle with a given area, for which the above procedure would be 
used, was involved in the construction of fire altars in the shape of a chariot wheel 
(rathacakraciti), a circular trough (with a handle) (dronaciti), rounded tortoise 
(parimandala kürmaciti). 

Given a square, the prescription for obtaining the desired circle goes as follows. 
Take half the diagonal of the square and drop it from the centre along the midriff, and 
draw the circle including a third of the part jutting out (see Figure 8; QR is i rd of 
QS, and P Ris the prescription for the radius of the desired circle). For a square with 


side 2a the prescribed radius works out to be a + $ (v2 = 1) a= (2 * v3) a/3. 


^This may be compared with realising that the resultant of two forces at a point represented by two 
vectors is given by the vector represented by the diagonal of the parallelogram formed by the two. 
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Figure 8: Circling the square. 


The area of this circle for the unit square works out to be 1.0172524 . . . , slightly 
more than that of the original square.? 

In Baudhayana there is no further comment on the sütra. On the other hand a 
sutra in Apastamba (see the next paragraph) says in particular "as much is left out 
that much comes in", the reference being to the portions of the square that are left 
out when the circle is drawn and those which are incorporated inside. The comment 
gives an insight into how the choice for the radius of the circle was arrived at. 
Observing that the diameter should be between the length of the side and that of 
the diagonal, they looked for a proportion of the extra part which would ensure that 
as much area is left out that much comes in. The correct proportion for this (in the 
light of the modern knowledge) turns out to be 0.30993473 ...., and if it is to be 
approached by a simple fraction, as one of certain number of parts, then it would 
have to be one in three. That is the number they picked, presumably from intuitive 
considerations. Incidentally, if they had chosen the proportion to be 3 out of 10 (not 
inconceivable in the context of the number 10 having acquired significance as a 
base in counting numbers) they would have got a better result; for the unit square 
they would have got a circle with area 0.99271948 . . . . 

The sutra from Apastamba relating to the area of the circle is followed by 
sanityamandalam and then yavaddhi yate tavadagantu. The second part means “as 
much is left out that much comes in”, which was quoted above. There has been 


SIn the literature there is an inclination to talk of something like “what value of z this amounts to”, 
if the area of the circle is to be 1 as expected (it turns out to be 3.0883118 . . .). I find this peculiar; there 
was no concept of x in the Sulvasütras, and they were not computing such a ratio. The comparisons 
should be in terms of the output produced, as followed above. 
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a debate in the literature on what the first part stands for. One of the commenta- 
tors Karavindasvami interprets the lines as “The circle is exactly as large as the 
square, for as much the circle falls short, so much comes in". Another commentator 
Kapardisvami breaks up sanityá as sā anityà and asserts that the circle construction 
is an approximate one. While linguistically both the interpretations are possible 
for the first part it seems certain that Apastamba meant to state that the circle is 
exactly as large, and without that the following part would be lacking in context. 
It does not seem meaningful to say “this is an approximate construction; as much 
is left out so much comes in". The alternative interpretation is motivated by the 
call, after a passage of time, to interpret it in a way consistent with the knowledge 
that it is in fact an approximate construction; this point has been made in [20]. 
The fact that it is only approximate would I believe have been suspected in the 
times of the Sulvasütras. The construction was applied in building fire altars of the 
size of 73 purusas (approximately 45 square meters), to be tiled into 200 tiles. A 
close to 2 96 error would have been noticeable in such a context to some of those 
directly involved with the altar building. On the other hand, the error is not big 
enough to make a definitive case about it. As I see it, Apastamba's statement is 
aimed at putting forth forcefully, in the face of doubts (murmurs ?) of sceptics, the 
scholastic position of the time: that is the exact circle; as much is left out so much 
comes in. In course of time the suspicions would have gained weight, serving as 
motivation to look for an alternative construction, and one was indeed found by the 
time of Manava Sulvasütra, which we discuss below. In Katyayana however only 
the original construction is described; even though it is a later Sulvasütra, being 
in a different stream it was perhaps untouched by the development around that 
sutra. 

Manava describes another construction for the circle with the area of the given 
square (cf. stra 11.15 in [19] and 10.3.2.15 in [11]). While it actually happens to 
be more accurate as we shall see below, the statement is very unclear, as a result 
of which it does not seem to have been understood properly; in particular I believe 
the comment on it in [19]Ó is unwarranted, and the algebraic presentation in [11] is 
incorrect. Let me first present my interpretation of what the sütra means, and then 
discuss the wording in the sütra and the lacunae making it unclear, that apparently 
led to the confusion over it. 

Given a square, draw a line parallel to one of the sides and dividing the square 
into one-third and two-third parts (see Figure 9). Extend the line to meet the circle 
through the vertices of the square. Now consider one of the segments of the line 
from the square to the circle and take the point at one-fifth of its lengtb, from 
the point on the square. Draw the circle passing through this (with centre at the 
midpoint of the square); in Figure 9 QR is i th of QS and PR is the prescription 


ÓThe comment is: “Possibly these are not problems of quadrature of the circle. Ordinary squares are 
drawn without any mathematical significance." 
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Figure 9: Manava construction for circling the square. 


for the radius of the desired circle. For a square of side length 2 the length of the 


segment between the square and the circle is seen to be (4 — 1), so the square 


2 
of the radius of the circle is fi + i (X — 1)} + 4, and this yields the area 


of the circle to be 3.9787159...; for the unit square the area of the prescribed 
circle will be 0.99467897 ..., a much more accurate value compared to the earlier 
one. 

The wording of the sütra however has a lot to be desired from the point of view 
of clarity. Firstly, rather than asking the square to be cut into three equal parallel 
strips the sütra asks it to be divided into 9 equal squares, via trisection along both the 
sides; presumably this is on account of a preoccupation with symmetry. Secondly, 
there is no mention of the outer circle. This seems to have led the authors of [19] 
to give up on the meaning of what follows, and content themselves presenting 
only an ad hoc translation, with the comment as noted. In [11] the author has been 
prescient of the unmentioned outer circle and that a portion of the segment between 
the square and the circle is involved. But here there is confusion in interpreting the 
UE half of the sütra. This part has a clause /umpetpuriseneh which translates 

s "cover it with loose earth", meaning obliterate it (a part of the segment as 
wane but treri is ambiguity about what part is to be obliterated; it could refer to 
obliterating 4 th part keeping the rest, or the other way around. In [11] the former 
ipterpreetion is adopted, which may be appropriate linguistically, but is pot what 
is actually intended. From the whole context it is clear that the remaining = s 3 th part 
of the segment was meant to be obliterated; it would be too much of a coincidence 
otherwise for the value to come out as close as seen above. It may be borne in mind 
that at the time of the Sulvasütras these shortcomings in the presentation would not 
have mattered, as the correct meaning would have been handed down orally. The 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative . 


28 S. G. Dani 


text was intended only to serve as an aide memoir. In particular the unmentioned 
outercircle would have been presumed on account of the analogous one in the earlier 
construction. 


\. 
| 


4. Squaring the Circle 


The problem of “squaring the circle" viz. finding a square with the area of a 
given circle is also considered in the Sulvasütras. Baudhayana Sulvasütra, which 
has the best treatment of the issue from among the Sulvasütras, deals with it, 
not in terms of a geometric construction as with the converse, but by relating 
numerically the side of the desired square to the diameter of the given circle; 
it has been argued in [9] that sutra 3.2.10 from Manava is in fact a geometric 
construction for squaring the circle, which we shall discuss later in the section. 
Baudhayana gives two formulae in this respect, the second being qualified as an 
approximate one. The first of them we shall discuss in some detail below. The 
second consists of taking Bth of the diameter of the given circle for the side of the 
desired square. Curiously, only this more approximate formula has been described 
in Apastamba and Kātyāyana Sulvasütras for the purpose. This formula is very 


crude: for the circle of unit radius it gives the square with side S , Whose area would 


be my = 3.0044444 . . ., in place of the correct value 3.1415927..., smaller by 
more than 4%. Even in terms of fractions with small denominators there are better 
approximations possible. The Egyptians took sth of the diameter as the side of the 
corresponding square (see [8], Chapter 13). This gives for the unit circle the square 
with area 3.1604938 . . . , which is more than the actual value by just about 0.6%. 
Another simple fraction, e as the ratio of the side of the square to the diameter 
also yields a better result: for the unit circle the area of the square would be 3.0625, 
which is about 25% less than the actual value. Incidentally, z would have been a 
natural choice for Baudhāyana, in giving an approximation, since as we shall see 
below it is a component of the refined value given by Baudhāyana. It would seem 
that g was the traditional value (going farther back in time) and even after the 
refined formula was discovered the traditional value continued to be quoted without 
modification. 

Intheconstruction of the altars there does not seem to be any occasion to convert a 
circle into a square. This would suggest that the relation as above was recorded in the 
Sulvasütras (all four) only to convey a general sense of magnitude (and perhaps in 
deference to tradition). On the other hand the other value described in Baudhayana, 
that was referred to above and we will now discuss, is apparently determined with 
serious effort. 

The refined formula for the ratio of the side of the square to the diameter of the 
circle with the same area is described as follows: divide the diameter into 8 parts, 
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divide one of them into 29 parts and remove 28 of the parts and from the remaining 
one remove a sixth part and include eighth of it; thus it is 


i l 1 1 


88x29 8x29x6 8x29 6X8. 


The fraction is ES and for the circle with unit radius yields the area 3.0883265 ..., 
a little more than 98.3% of the actual value. 

A discerning reader may notice that Baudhayana's construction of the circle with 
the area of a given square produced a circle whose area was bigger by about 1.7% 
while the above prescription in the opposite direction produces a square which is 
smaller in size by about 1.7%. In other words if one started with a square followed the 
construction as in the last section to get a circle and then used the above formula to get 
back the square it would be very close to the original. A more precise computation 
(in place of what is pointed out as something to be noticed at first glance) shows 
that if one starts with the unit square, transforms it into a circle and then transforms 
that into a square as indicated, one gets a square with area 1.0000024 . . . (!). This 
is no coincidence. The desired fraction as above was evidently computed as the 
inverse of the ratio involved the other way, namely in circling the square, or else we 
could not have got such a close answer. If the computation of the inverse could have 
been exact the above number would have come out to be exactly 1. However in the 
computation of the inverse certain approximations had to be made, leading to the 
small difference. 

It may be recalled that for the unit square the diameter of the correspond- 


ing circle obtained by the Baudhayana method would be (2 + v2) /3. For /2 


Baudhayana has a(an approximate) formula, hich uc shall discuss in detail in 


the next section, associating with it the value 1 +4 + a — =i and the ratio 


(2 + 2) /3 to be inverted would thus be 1 + l an ai = ao (expressed in 


the style of the Sulvasütras). By our current method we would write the number 


as 1383 and produce 1225 as the inverse. However, this method may not have been 


known at that time. Thus they either did not identify the number as Lx or did 
not consider the reciprocal 1225 for the inverse formula, for one or other reason, 
including possibly that it can not be expressed in terms of making smaller number 
of parts as in the above expressions, as 1393 is a prime! Instead their computation 
seems to have gone along the following lines. A part i was first separated as the 
significant part; this could be either following a way that they may have had to find 
the inverse of a fraction expressed as a sum of the terms as above, or could be an ad 
hoc choice noticing that the inverse should be greater than that. The remaining part 
after separating out 2 turns out, in our notation, to be 51. They however opted 
to replace it by 1355-3, which could be split up as <5 — 
arriving at the quadrature formula. 


1 1 
$xoxe + Exijxexg. thus 
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It does not seem to have been noted in the earlier literature, but it turns out that 
taking only the first two terms in Baudhayana expression yields a better result. 
The fraction would then be i, which for the circle with unit radius yields the area 
3.0927467 . . . , closer to the actual value z = 3.1415927... compared to the value 
3.0883265 ... obtained above. Thus the additional two terms in fact worsen the 
result in terms of the correct value for the area of the circle. Of course this could 
not have been anticipated by Baudhayana. Interestingly, however, if the alteration 
of 1393 in the denominator to 1392 were done before separating out the part i then 
they would have 'accidentally' got the inverse to be ES as above. This however is 


not as close to the inverse of (2 + V2) /3 as Baudhayana’s value was seen to be, 
which would have been the objective at that time. 

The choice of à as the part to be separated is interesting, especially if it was ad 
hoc. Why was it not chosen instead to be B, the supposed approximate value. Was 
it known that it was better than the latter, even though it had not been stated? Why 
did it then continue to be unstated even after the refined formula was exposited. 
Or were they from two different substreams of thought, even though they occur 
alongside each other in Baudhāyana? 

Let me now come to the sutra from Manava Sulvasütra, sūtra 3.2.10, to which 
a reference was made in the beginning of the section. Some developments around 
it are rather puzzling. Though the sutra is rather condensed, an overall reading 
readily indicates that it has something to say on the question of squaring the circle, 
as also the converse. However various commentators, including van Gelder who 
produced the first edited version and English translation, have interpreted it to 
relate to only the converse problem of circling the square, in a way equivalent to 
that in Baudhàyana. A new interpretation was given by Hayashi in [9], after some 
emendation of the original text which has been justified on grounds of rectification 
of possible corruption of the manuscript over a period. According to [9] both the 
problems of circling the square and squaring the circle are addressed in the sutra. 
‘| The square with the area of a given circle is described to be of side equal to the height 
| of the equilateral triangle over the diameter of the circle; the converse statement 
1 continues to be interpreted to be essentially as in Baudhayana, but there is also a 
1 slight variation offered (attributed to the referee of the paper). It may be observed 
Í that for the circle with unit radius the side-length of the stipulated square would be 
| ./3, which yields the area of the square to be 3, in place of = 3.1415927..., less 
by about 4.5%. This is a rather crude value, much cruder than even the approximate 

value given in Baudhayana and stated also in Apastamba and Katyayana. In the 
context of the improved result from Manava Sulvasitra that we saw in the last 
section, both direct and converse statements as above seem surprising; the first part 
on account of the crudity of the value and the second being not as sharp as the 
improved result that we noted. However such anomalies seem to be inherent in 
Sulvasütras, as we have seen elsewhere also, since they continued to exposit the 
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traditional values of their time with no critical evaluation, even after better results 
became known. It may be noted in this context that the s#trakara characterises the 
contents of the above sutra, by a mention in the preceding sutra, as "prescriptions of 
informed persons”, traditional knowledge in other words. The refined method for 
circling the square described in the preceding section comes later (though it is not 
clear if much can be read from the sequencing) and is presumably either due to the 
author himself or more contemporary knowledge of his time. 

While we are discussing the area of circle the reader may wonder what the 
Sulvasitras had to say about the circumference of the circle. In Baudhayana there 
is only a incidental reference to it, where a circular pit “with diameter 1 pada and 
circumference 3 padas” is mentioned, indicating that the circumference was taken 
to be 3 times the diameter; interestingly this coincides with the value according to 
the Bible. In Manava Sulvasütra however the circumference is stated to be 3 + ith 
of the diameter (sutra 10.3.2.13 as in [11]; the interpretation of the sutra in [11] as 
describing the side of the square is incorrect); the statement of the ratio is followed 
by “not a hair-breadth remains”! Evidently a lacuna in the classical value had been 
overcome and it was worth exulting over it! 


5. The Square Root of 2 


Three of the four Sulvasütras, Baudhàyana, Apastamba and Katyayana, describe 
the following formula for 4/2 (in words) expressing the value as 


l l 1 
1+-+ — —- ———_.. 
trat Ska 3 x 4x34 


The sūtra is followed by savisesah in Baudhayana, sa visesah in Apastamba and sa 
visesa iti viSesah in Katyayana. The word visesah means "extra" and the wording in 
Katyayana Sulvasütra conveys that the number is in excess of the desired quantity 
(cf. [12]). visesah was also used as a technical term for the excess of the diagonal 
over the original measure, and savisesah thus signified “together with the extra", 
namely the diagonal itself. As such the term itself is not connected with the value 
being considered to be in excess, or approximate. 

Though we would not know how they obtained it, such a complex expression 
would have been arrived at by some process (and not directly by intuition or 
guesswork). Whatever the process, along the way it would have been noticed that 
the targeted magnitude has not been reached exactly. Thus it would have been 
known, at least at the time when the value first came to be assigned, that it was not 
exact. (It is also tempting to argue that they could have squared the number and 


found that itis not 2, but it is doubtful if they would h : i 
exercise). y ave taken up such an arithmetic 
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In decimal expansion the value of the expression is 1.4142157..., which is 
remarkably close to the actual value 1.4142136.... The closeness is striking. A 
few points should be noted in this respect however. Firstly, it may be recalled 
that the Babylonian cuneiform tablet YBC 7289 (Yale Babylonian collection), 
dating back to around 1800 to 1600 BCE also contains an approximate value 
for /2, in the sexagesimal system as 1;24;51;10, which yields the 1 + za SE 
ar T a = 1.4142129.... Furthermore the Babylonian value described in a pre- 
existing standardised system with uniform subdivisions, namely the sexagesimal 
system of representation, carries an intrinsic sense of the level of accuracy. The 
Sulvasütras value however is evidently the output of a specific process, and there 
is no intrinsic gauge of accuracy involved. No details of how the procedure went 
are found in the Sulvasütras but some likely methods have been proposed, that we 
shall discuss below. Em is context of these it would seem that while refining on the 
approximation 1 +4 + z4 = = I= = 1.4166667 . . . , the next step took them directly 
to the above fine A due to in special ore of the number 4/2; it could 
thus have been fortuitous to have got so close to the actual value. 

Actually there is also no indication that they had serious interest in accuracy 
of that level. No other nonexact relation found in the Sulvasütras is that close. It 
may be recalled that even when better values were available in the case of the 
quadrature formula, for instance, only a rough approximation was recorded in 
the later Sulvasütras. For 4/2, Manava, long after Baudhayana, proffers only the 
approximate value 17 7. Thus while they were deeply preoccupied with geometric 
aesthetics, high uso accuracy is hardly seen to be considered seriously as an 
issue to be pursued. 

There have been various suggestions on how the formula may have been arrived 
at. One convincing possibility put forth by B. Datta [4] (see also [19]) goes as 

follows. The presentation here is essentially along the lines of [19] with some 
modifications for clarity. We have two unit squares to be put together and formed 
into a square. Divide the second square into three equal parts by lines parallel to 
one of the sides. Two of these you attach at the sides of the first square sideways; 
RR produces a gnomon CS ERG 10). Divide the third part into a square of side 
3 1 and a rectangle of w $ and } - The square part is used to convert the gnomon 
into a square of side 1 ti "The rectangle is divided into four equal parts along the 
length, into rectangles with sides 2 3 and =] ES z: Lengthwise two of these strips can 
be placed along the sides of the square formed earlier, side by side. We thus get a 
square except for a missing little square at the corner, of side 4 D 

Now what ye menn is a square which B a difference of a suis with side 
14i aoro a= H 7 and a small one of size + 5- If strips of width -7533 wa are cut out 
along the two sides, nat would adjust for the square to be removed, except for a tiny 
square with side —— UC formed of the overlap of the two strips. The latter is ignored, 
and the square formed by cutting out the strips is produced as the answer. 


coe ae ; 
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Figure 10: Possible route to the formula for A, 


It has been argued in [7], convincingly, that the Babylonian value of /2 (noted 
above) would have been obtained by a procedure involving the following idea. 
Suppose we have a square with a given sidelength and we would like to get a 
square with an area larger by a certain amount which is relatively small compared 
to the area of the original square. The difference between the desired new square 
and the original one is seen to consist of two (long) rectangles and a small square 
(the reader may see Figure 6 for reference); the long side of the rectangles has the 
original sidelength and the other side corresponds to the desired increment. As the 
remaining square is small, the desired increment in the area divided by twice the 
original sidelength would be a good approximation for the requisite increment in 
the length; the area to be added is thus put in the two rectangles themselves. In 
modern notation, if / is the original sidelength and a is the amount of area to be 
added, then / + £ is an approximation to V/* + a; this may be compared with 
the binomial expansion. Similarly, if we have a square of a given sidelength and 
wish to get a square whose area is smaller by a certain amount, then the latter 
divided by twice the length of the original square gives is an approximation to the 
requisite reduction in length; this corresponds to / — 5; as an approximation to 
V1? — a. 

It is illustrated in [7] in detail how the Babylonians may have arrived at their 
value of 4/2 using the idea, in two steps: Starting with a square with sidelength 5 and 
seeking to get a square with area 2, the side is reduced by (2 — 2) / (2 x 3) = 4, to 
1 5 (in [7] the details are explaiend in the Babylonian sexagesimal notation, but we 
shall not go into it here). This is still a larger value and a second reduction is effected 
to get a second approximation to A/2. The fraction that one is after for reduction in 


: z 2 
the second step would be (in our notation) (& — 2) x $ = inu same as the 
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last term in the Sulvasütras expression. The Babylonians did not deal with fractions 
directly however, and the desired numbers were found in sexagesimal points (as 
our calculators would do in decimal points) and that led them to the approximation 
1; 24; 51; 10 for /2, in sexagesimal notation, as noted above. The possible routes to 
this have been discussed in [7]. Interestingly (and rather fortuitously!) 1; 24; 51; 10 
is closer to 2 than P — 54. which was the approximation sought to be computed 
(according to [7]). 

A procedure as above is well within the scope of the thought process involved 
in the Sulvasütras, and it could well have been known; it is also possible that the 
idea may have come down from the Babylonians, but there has been no supporting 
evidence to that effect. In the case of Sulvasütras the approximation as above may 
have been obtained from the above procedure by starting with a square of sidelength 


1. This is a smaller square and the area to be added is (2 — (3)’) x Tu = xy: 
which yields the first approximation B. In the second step this would lead to the 
quoted value, as seen above. This seems to fit better with the way the number is 
expressed. 

It has been pointed out in [7] that the approximation as above is mathematically 
equivalent (emphasis as in original) to Heron's method for extraction of square 
roots, which involves the following: Let A = /* + a. If is a (first) approximation 
to V/A, A/I also approximates V/A, and moreover if l < A then A/1 > V/A 
and vice versa. Therefore E (i+ 4) is candidate for a better approximation. But 
i (i+ 4) =14+ F(A-P) = 14 5j» the approximation seen earlier. It may therefore 
seem tempting that to argue that the approximate expression for 4/2 was obtained via 
such averages. However, the arriving at it in this way involves a kind of familiarity 
with algebra that is not evidenced in the Sulvasütras; [7] does not seem to suggest 
it in respect of the Babylonians either. 

Some authors have also brought in the more elaborate expression / + 5 — 
(zy /2 (L+ $) for Vi? 3- a. ([15] or [19], and [2]) as explanation for the ap- 
proximations for 2 in Sulvasütras. Some of the other suggestions, including 
one by Thibaut, who produced the first English edition of Baudhayana supple- 
menting it with his commentary, also involve taking difference of squares (see 
[20] or [19]), relying on some arithmetic. These explanations however do not 
seem natural for the context, and seem to depend on ideas that evolved only in a 
later period, unlike the geometrical explanations above which are consistent with 
Sulvasütras style. 

Here is another possibility which I would like to propose, taking the context of 
the Sulvasütras into account. Having found the need for a numerical value for the 
diagonal of the square (I will argue below that this was the case), one may simply 
choose to find it by measurement with the ropes. This seems natural also from the 
point of view that they were using ropes for a variety of constructions. Take a rope 
of length equal to the diagonal of a square. Mark out one unit from one end. By 
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measuring the unit piece in terms of the remaining piece one notes that the latter 
is more than ird and less than half. We take away ird from it and consider the 
remaining part. It is now easier to compare the remaining piece with the second 
piece of length m rather than the unit. The remaining piece turns out to be slightly 
less than 4th of the second piece. Thus the diagonal is slightly less than 1 + 4 + 325. 
How much less? You measure the third piece with the length that fell short. Itis very 
close to 34 times. Thus you decide to take away gh of the last piece, which gives 
the formula. One may start with a rope of length 35 feet for the unit, so the required 
total rope length, namely the length of the diagonal, is about 50 feet. In this case the 
Jength of the final remaining piece as above will be over an inch and the penultimate 
piece to be measured with it would be about 3 feet, comfortable sizes for manual 
measurements, establishing that it is almost exactly 34 times. Note that use of over 
50 feet long ropes was quite common in the Sulvasütras constructions; the rope 
involved in the ekarajjuvidhi for the construction of the Mahávedi measured over 
100 feet. There are some variations possible over the above theme, and though one 
may ask why this one was picked, the answer can simply be that through trials this 
would have been found most convenient. 

What was the motivation of the satrakaras for determining the value of J22 
We have seen before that the A/2 is involved in the Sulvasiitras as the diagonal of 
the unit square. It is called dvikarant. As a magnitude it was involved in doubling 
the area of the figures to be drawn and, as mentioned earlier, this was achieved by 
enhancing the reference unit to the size of the diagonal of the corresponding unit 
square, and following the same steps of construction as before, for the figure to be 
drawn. For this purpose however one does not actually need the numerical value 
of 4/2. Nor would the complex formula be of much help in producing a mark for 
dvikarani, as it would involve the cumbersome process of subdividing the segments 
into as many as 34 parts!.’ It would be much simpler to draw a square and mark its 
diagonal on the rope. The numerical value was needed on the other hand in deriving 


the quadrature formula, in order to find the inverse of (2 + 42) /3. as seen in the 
last section. As noted by Seidenberg [17] almost certainly this problem served as the 
motivation for finding a numerical value for V2. Of course, as the reader may recall 
from the earlier sections, the later Sulvasütras, Apastamba and Katyayana included 
the value of 4/2 but not the refined quadrature formula as in Baudhayana for which 


"In[10] itis stated (on page 361), referring to the papers of. Albert Biirk on Apastamba Sulvasütra, that 
the approximate formula for 2. was used for the construction of a square. The square that is referred 
to is in fact the one described towards the end of Section 1, constructed by the Nyanchana method with 
the triple (1, 1, 4/2). For the length JZ the original sūtra asks to use the savisesah. This apparently has 
been interpreted by these authors as using the approximate value for it than the diagonal itself. However 
savisesah stood for diagonal as well, and there is no reason to suppose that in the above construction they 
followed the cumbersome procedure involving the formula for JD to fix the length, rather than using the 


diagonal itself, in the manner l indicated. 
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the value of 4/2 was used. This could however be simply due to the fact that once 

the value was found it may have seemed appealing to the later authors. 
There has been unwarranted speculation in the literature, imputing knowledge 
of the irrationality of 4/2 to the sütrakáras (see [4], the discussion on p. 195). 
Ed They would have been aware that the value they gave was an approximate one, 
and that is the closest it gets. To infer from this the knowledge of irrationality 
of 4/2 is to miss or to trivialise the significance of the notion of irrationality. 
i Irrationality is not just about observing that attempts to express a number as a 
| fraction yield only approximate results (even this can not be said to be subsumed 
| in its entirety in the Sulvasütras as there is but one approximate value described), 
| but to be able to assert, with valid argument, that it is impossible to express it 
l| $ as a fraction. The Śulvasūtras were concerned with practical aspects of geom- 
| etry of altar building, and some of it they abstracted, developing a theoretical 
| 1 understanding. One does not find them getting involved with philosophical or arith- 


metical aspects of numbers, and there is no point in indulging in wishful thinking 
about it. 
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On the Origins of Zero 


Frits Staal 


1. Indic Origins? 


IT IS WIDELY believed that zero originated in Indic Civilization but the evidence 
in support of that belief is not only meager; it is almost zero. No place or time, let 
alone the name of a discoverer or inventor, has ever been suggested. How can we 
handle such a problem? We must start from the beginning. 

Indic Civilization starts with the Indus Civilization which is earlier than the 
Vedas. Its inscriptions exhibit occurrences and sequences of circles that resemble 
the numerals that have expressed zeroes in more recent times; to be a little more 
precise: more than three thousand years later. Other civilizations roughly contem- 
porary with the urban complexes of Mohenjo-daro and Harappa used circles also, 
but they did not refer to zeroes. On cuneiform tablets from Uruk in Mesopotamia, 
dating to 3,000 BCE, circles refer to the number “2” (Tropfke 1980: 29). It weak- 
ens the suggestion that the Indus circles were expressions of zero. A more serious 
difficulty is that the language of the Indus inscriptions is not known. It is not even 
clear that it was a language and its uses are controversial. Some of these topics are 
discussed in Staal 2008: 7-11, which provides the evidence and further references. 

With Vedic mathematics we are on firmer ground. We find not only geometry 
but integers, a rudimentary decimal system for counting. It did not include nega- 
tive numbers, but “addition, subtraction and perhaps multiplication of whole num- 
bers" (Hayashi 2003: 360-61). The Rigveda made use of recursion (Divakaran, 
forthcoming). It did, moreover, distinguish between cardinal and ordinal numbers 
(Renou: 1964: 92; Staal 2007: 589-590; 2008: 272-273). In all these cases we are 
dealing with numbers, not with numerals: the Vedas are an Oral Tradition since 
there was no writing on the subcontinent prior to the Buddhist Emperor ASoka who 
reigned from 268 to 231 BCE. But "zero" did not only lack a symbol. There was 
no term for it in the oral tradition. The word kha, which Indian mathematicians 
used later to denote zero, occurs in Vedic only in the senses of “hole”, “opening”, 
“vacancy” or "space". 

Counting boards based upon the decimal system took another step but the Indic 
evidence is of later date and the empty spaces are zeroes of a kind, not symbolic 
expressions. Even today, the Indian pandita or traditional scholar uses neither 
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counting boards nor books: he carries his knowledge in his head. Writing had 
its commercial uses, manuscripts go back to the beginnings of the Common Era, 
but it began to be used widely only after the invention of Hinduism in the early 
nineteenth century (Staal 20082). 

Fortunately, it is not the end of our story since those who look for the origins of 
something, even if it is zero, must look beyond the domain where it is customarily 
located, even if it is absent. The remaining parts of my essay attempt to do so. Part 
2 will discuss two pioneering investigations. Parts 3 and 4 will take us beyond the 

history of mathematics. Part 3 will pay attention to linguistics and Part 4 to the 
Vedic theory of ritual, not included in modern classifications or curricula though 
regarded during the period of middle Vedic as an exact science. Having gone that 
far we must recall, that “exact” and “science” are often no more than labels and 
that all names of disciplines are due to us, not to the universe to which we belong. 


2. Khmer and the Buddhist Madhyamaka 


Two original contributions, undigested legacies from the twentieth century, have 
to be taken into account if we wish to understand how zero may or may not have 
been discovered. The first is due to Joseph Needham, the famous scholar and 
scientist who published, together with his collaborators, the many volumes of Sci- 
ence and Civilization in China. The second is due to David Ruegg, a brilliant 
Sanskrit scholar whose early work dealt with the Sanskrit grammarians but whose 
chief contributions since have been to Buddhist Studies, primarily as expressed in 
Sanskrit and Tibetan sources. 
Needham presented his ideas in the third volume of his series. It is entitled 
"Mathematics and the Sciences of the Heavens and the Earth" and was first pub- 
| lished in 1959. It is important to understand what Needham tried to do for he 
has been criticized and misunderstood like other pioneers. He did not examine 
Chinese sciences from a simple “evolutionist” concept of history, as if they were 
“more or less clumsy attempts to express modern scientific ideas,” a notion that 
Pingree wisely rejects in another context (Pingree 2003: 45). But neither did 
Needham present Chinese sciences from “the Chinese point of view,” whatever 
that is, as Seyyed Hossein Nasr (1968: 21) tried to do with respect to the Islamic 
Sciences. Needham’s perspective is different and he has expressed it in unambigu- 
ous terms: “to write the history of science we have to take modern science as our 
yardstick—that is the only thing we can do—but modern science will change and 
the end is not yet" (Needham 1976: xxxi with further discussion in Staal 2006: 
91-97). 

D. J. de Solla Price, a historian of science at MIT, described Needham's work 
as follows: “In my estimation, the essential contribution made thus far by the 
six volumes of Science and Civilization in China lies in the systematization and 
presentation in English translation or summary of the substantive content of the 
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otherwise ill-digested bulk of Chinese scientific and technical literature. Here we 
have the raw material on which generations of later scholarship can be founded. 
Here at last we have some map to tell us where to look, and some indication at 
least of what we shall find" (1971: 17-18). 

Needham's volumes deal with much more than the Chinese sciences. They 
abound in references to Indian, Mesopotamian and European disciplines. The dis- 
cussion on the origin of zero begins to meander in Volume 3 on page 9. Needham 
knows, of course, that zero is widely believed to have originated in Indic Civiliza- 
tion, he is familiar not only with the Chinese but also with Indic counting boards 
and then takes an unexpected turn: he zeroes in on early South East Asian inscrip- 
tions. The thesis that emerges will not surprise us but is not formulated in a few 
simple sentences or in a single paragraph. We get the idea when we combine three 
separate sentences that occur on pages 10-12 and that I quote here because they 
are Needham's own words: 


e "The usual view is that the circular symbol for zero derived directly from 
India, where it first appears on the Bhojadeva inscriptions at Gwalior 
dated 4-870." 


e "While the first epigraphic evidence for the zero in India is, as has just been 
mentioned, of the late +9th century, it has been discovered about two cen- 
turies earlier in Indo-China and other parts of south-east Asia. This fact may 
be of much significance." 


e "It would seem, indeed, that the finding of the first appearance of the zero in 
dated inscriptions on the borderline of the Indian and Chinese culture-areas 
can hardly be a coincidence.” 


I shall not discuss the first sentence though more recent discoveries of Indian 
inscriptions have pushed the dates further back. It neither affects Needham’s the- 
sis, nor the origins of zero as we shall see. The second sentence is based upon 
an early article published in 1931 by another celebrated scholar, George Coedés, 
the French historian of South East Asia and especially of the Khmer Empire of 
what are now Kampuchea and parts of Thailand. He did not only write the classic 
account of the ‘Indianization’ of Indochina and Indonesia that was first published 
in French, often reprinted since 1964 and translated into English as The Indianized 
States of Southeast Asia. He also published eight volumes of Cambodian inscrip- 
tions, and wrote much else that inspired thousands of more recent publications, all 
of them supplementing, improving and updating his work, just as de la Solla Price 
had predicted of Needham’s series. As for the third of Needham’s three sentences, 
its significance depends on the significance of the second. 

Needham’s idea is based upon two inscriptions discussed in Coedés article. 
One comes from Cambodia, the other from Indonesia. Both show the zero, one in 
the form of a dot and the other in the shape of a small closed curve that may be 
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a circle. Both are dated 683 CE, almost two centuries earlier than the alleged first 
Indic inscription of 870. 
Needham does not comment on the puzzling circle, if it is a circle, which comes 
from the small Indonesian island of Banka, but he stumbles on both inscriptions | 
and then makes his fatal lapse. He assumes throughout that their language is | 
Southeast Asian but does not say which language it is. Is he thinking of Khmer or 
} Old Javanese? The simple truth is that the language of the inscriptions is expressed 
in a Southeast Asian script but is none other than Sanskrit. 
i This was obvious to Coedès as every careful reader of his article can see. He 
P" writes on page 325: “au Cambodge, les premières inscriptions sanskrites datées 
) font usage des mots symboliques"; *au Champa, les deux plus anciennes inscrip- 
tions sanskrites datées ...en langue sanskrite"; "les inscriptions sanskrites du 
Champa"; “a Java, la plus ancienne inscription sanskrite datée fait usage des mots 
symboliques” (all italics mine). These quotes show that "the first appearance of 
the zero in dated inscriptions on the borderline of the Indian and Chinese culture- 
areas" does not only fail to be a coincidence. They demonstrate that the content of 
| these inscriptions is Indic. 

I now come to David Ruegg who wrote a brief but substantial article partly 
concerned with our problem. It is entitled “Mathematical and Linguistic Models 
in Indian Thought: The Case of Zero and Sunyata” and was published in 1978. 
Sunyatd refers, as is well known, to the Buddhist concept of ‘emptiness.’ It is 
a characteristic feature of the Madhyamaka school and was foreshadowed by a 
certain Bhadanta Vasumitra who may have lived at the end of the first or beginning 
of the second century CE. Its context is the theory of dharmas, which do not refer 
to the Buddhist dharma or ‘doctrine, but to “elements or factors, each of which 
is considered to bear its own specific characteristic that determines it’ as Ruegg 
explains the expression of the Abhidharmakosa: 'svalaksanadharanad dharmal . 

I shall not further some readers' possible annoyance with Vasumitra's Sanskrit 
but here is Ruegg's translation: “A dharma evolving in the [three] times is stated 
to be other according to the different states it enters, [the change in question being 
then] due to otherness of state (avasthantaratah) but not of substance". The words 
and phrases within square brackets are due to Ruegg, who explains the example 
that Vasumitra adds as follows: “like a marker or counter (vartika) in reckoning 
which in the unit position has the value of a unit, in the hundred's position that of 
a hundred, and in the thousand's position that of a thousand"—a straightforward 
expression of the use of zero as a place-marker in the decimal system. 

Ruegg adds that the same idea is sometimes expressed by the term gulika, “ball” 
or “bead” which, like the counting boards to which they belong, should remind us 
of the fact that mathematicians are not always concerned with what modern readers 
think of almost exclusively, viz. writing. Needham is familiar with this usage of 
sunya because he compares it to the empty spaces on Chinese counterboards in a 
long footnote on his pages 11-12. 
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Ruegg discusses two terms for zero: kha, which we have already met, and 
bindu, which means "dot". He draws attention to the Vasavadattd, a literary work 
of the sixth century by Subandhu which uses sunyabindu to denote the symbol for 
zero. He refers next to Pingree's work on the Yavanajataka of Sphujidhvaja, then 
about to be published. That text was composed in 149/150 CE and used the term 
bindu in "the earliest reference known to the decimal place-value system with a 
symbol for zero in India" (Pingree 1978, II: 406, I: 494). 

The next topic Ruegg considers is the history of the term Sunya. He starts 
with the Rigveda which employs suna for "lack, absence, emptiness.” (One of 
the earliest examples is "lack of sons” in RV 7.1.11.) Later Vedic has sunya in 
the meaning “hollow, deserted”. After providing more information on Sunyara, 
he concludes cautiously that we cannot trace connections between the Buddhist 
"emptiness" and the mathematical concept of zero. 

Ruegg then turns to early linguistics and Panini's lopa. We shall look at its 
apparent invisibility in the next section. Ruegg notes its occurrence in modern 
linguistics but then diverges from our topic in grammatical and philosophical di- 
rections that involve sunyata without throwing light on the zero. His cautious 
peregrinations have inspired my own meanderings. The reader should note what 
has not been shown and remember Pingree's statement: the earliest written refer- 
ence to the decimal place-value system with a symbol for zero in India is dated to 
149/150 CE. We have to look not only beyond writing but further. 


3. Zeroes in Sanskrit Grammar 


Almost all Indic mathematicians wrote in Sanskrit, the classical language of sci- 
ence that unites the subcontinent (Staal 1995). Malayalam is among the famous 
exceptions (Divakaran 2007). Persian and English became more common in pre- 
modern and modern times but Sanskrit continued in mathematics and Jyotihsastra 
or astronomy-cum-astrology (see, e.g., Minkowski 2002, 2008). Many of the clas- 
sics of Indian mathematics were composed in concise and sometimes elegant San- 
skrit verse. Here is young Aryabhata on the subdivisions of time: 

“A solar year is a year of men. Thirty of them make an ancestral year. Mul- 
tiplied by twelve is explained as a year for the gods” (ravivarsam manusyarm 
tad api trimsadgunam bhavati pitryam| pitryam dvadasagunitam divyam 
varsam samuddistam: 3:7) Another couple of lines condenses the full sine table 
in one couplet (1:12). And who does not know the penultimate verse: “From the 
Ocean of true and false knowledge I have, through the boat of my own knowledge, 
rescued with the grace of the deity the precious sunken jewel of true knowledge 
(sadasajjfiánasamudràt samuddhrtam devataprasadena | sajjnanottamarat- 
nam maya nimagnam svamatinava) 

How did these mathematicians know Sanskrit? It could not have 
first or native language. Many were not brahmans (Aryabhata was not of 


been their 
else 
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his name would have been Aryabhatta!); and neither need all brahmans know 
Sanskrit. Arya, moreover, does not refer to “the three twice-born classes" as it 
probably did in the Manusmrti, “The Law Code of Manu", 2.207 (Olivelle 2004: 
242). The distinction between Grya and anārya has not been a racial or ethnic 
distinction. Madhav Deshpande has shown that it expresses claims to moral, social 
and spiritual status, tending toward exclusion in so-called ‘Hindu’ legal texts and 
epics, but inclusion and transformation among Jainas and Buddhists (Deshpande 
1999). 
The mathematicians who wrote in Sanskrit, then, might have come from any- 
e where and their native language may be anyone's guess but they must have studied 
Sanskrit grammar. It is unlikely that they studied books. In India, no pandita or 
traditional scholar does. An aspiring savant may have been taught by his father or 
must have had a teacher, who had his own guru, etc. in the oral succession of gu- 
ruparampará, “the lineage of teachers". All that knowledge must ultimately have 
come from one of the many existing and surviving Sanskrit grammatical works. 
Two questions arise: which one did it come from and what did it say? 

Like other scholars, many mathematicians are likely to have studied the earli- 
est and most famous Sanskrit grammar: that of Panini of the fifth/fourth century 
BCE, or its later adaptations such as the Kasika of the seventh century CE or the 
Paniniyan grammar of Bhattoji Diskita of the seventeenth. Buddhists had their 
own grammars due to famous masters such as Candragomin of the fifth century 
CE, just as the Jainas had great grammarians from Devanandin (fifth c. CE) to 
Hemachandra (twelfth) and beyond (for more information on the Sanskrit gram- 
marians see Staal 1972). All these works were inspired, directly or indirectly, by 
the Paninian tradition. And all of them possessed not one but many zeroes. 

What is zero in grammar or linguistics? Panini had a technical term for it: 
lopa. He defined it as “something that does not appear" (adarsanam lopah 1.1.60). 
It is not a rare term in his grammar. Its “non-appearance” (adarSanam) does not 
prevent it from occurring in forty-five out of four thousand rules if I counted them 
correctly as they were enumerated by Bóhtlingk in his edition (1887, with many 
reprints: II: 271*). The actual number is higher since I have not taken account of 
Bohtlingk’s uses of the expression fgg which indicates “and following". 

Professor P.P. Divakaran, who commented on an earlier draft of this article, 
was intrigued by the definition of lopa as adarsanam because Panini certainly 
lived before ASoka who presumably introduced writing in the third century BCE 
and: “I should think that a sound which is absent would be characterised by Panini 
as unheard or unsounded rather than as unseen". 


| 
17 
3 
i 


‘The philologist’s concept of lectio difficilior, “the more difficult reading,” should be extended so 
as to be applicable to the oral transmission of compositions: the more difficult and unexpected of two 
readings or orally transmitted forms, viz., Aryabhata, must be preferred to the normal, expected form 
Aryabhatta, since it is unlikely that it is based upon an error. 
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This is an apt observation but there is more to say. Some interpreters have in- 
deed translated Panini’s adarsanam as ‘unseen’ since the verbal root drs- certainly 
means “to see". I translated it differently as “something that does not appear" like 
others have done because that same root is widely used in the much wider mean- 
ing of seeing with one's mind. It includes perception, observation, appearance, 
knowing, etc., and is a common meaning in Sanskrit and similarly in other lan- 
guages. In Indian philosophy, the six traditional systems are called darsana but 
their epistemology is not confined to seeing with one's eyes. In English we say: 
"I see what you mean". In later Sanskrit, na drsyate means: “it does not appear 
(that such-and-such is the case)". In English the same ambiguity applies to appear 
itself: "there appeared a large bird on the roof" suggests that the bird was actually 
seen but in "there appears to be much confusion about the PM of Thailand step- 
ping down", nothing may have been peiceived by eyes or eyes only; it is a topic 
that people are talking about. 

According to Renou's dictionary of grammatical terms in Sanskrit (Renou 1957 
under adarsana, lopa and lup-), adaráana means amuissement, a technical term 
in French which expresses that a phoneme is dropped in pronunciation. Renou 
refers to Katyayana, grammarian who commented on Panini and lived some two 
or three centuries later, probably during the reign of Asoka (a fuller but some- 
what opaque discussion occurs in Cardona: 1976, 1980, pages 267-273). That 
date could support an interpretation that refers to writing, but Katyayana himself 
does not see that way. Whether it was written or not, his statement is startling: 
adarsana means not seen, not heard, not pronounced, not perceived, absence or 
disappearance. It calls for a closer analysis and the entire subject seems to stand 
in need of a new and thorough revision but I think that, in the present context, 
we may safely conclude that drs- does not only mean “to see with one's eyes", 
that adarsanam does not only mean “unseen” and that “something that does not 
appear” is the best translation—for the time being. 

Mathematicians studying Sanskrit in order to be able to compose works with 
all-India appeal, could not have missed the numerous zeroes that occur in Sanskrit 
grammars. Modern readers are able to understand them provided they know some- 
thing of the morphology of words in Sanskrit. It is found in other Indo-European 
languages such as English; and in others. The examples that follow below under 
(1)-(3) occurred in three papers by Pandit: 1962, undated and 1990 (not seen), all 
of which, like Allen 1955, suffer from the complex constructs of outdated linguis- 
tic systems. No such defects disfigure Shefts 1961 who treated (2) and (3) and was 
reviewed in Staal 1963a (reprinted in 1988: 232-237). 

Before we begin I must mention that Sanskrit does not use an explicit pronoun 
like English. The verbal form khadati does not mean: “eats” but “he eats". That 
"he" disappears when there is a subject as in ramo khadati which means * Rama 
eats" and not “Rāma he eats" (where the asterisk express ungrammatically) What 
is important in our context is that Sanskrit distinguishes like English between the 
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stem and the suffix or ending of a verb. From the stem khad- (“eat-”) and the 
ending -ti (“-s’’) it forms: 


khad-a-ti (he) eat-s" (1) 


What is -a- ? It is something in between which we may call an infix. I put hyphens 
between the three elements in the Sanskrit expression to distinguish the stem and 
the ending, which we find in both Sanskrit and English, and highlights the infix 
we only find in Sanskrit. 

There are, however, various classes of verbs in Sanskrit. One of them has a 
verb with the same meaning but lacks the infix: 


ad-ti (he) eat-s” (2) 


which becomes arti which is due to what is sometimes called “assimilation”. 
Panini’s grammar is a list of rules (sūtra). (1) follows from a general rule. (2) 
seems to illustrate a special case. However, (1) and (2) express similar properties 
and the underlying rule is the same if we adopt a principle called /aghava, literally 
“lightness”. (2) is now expressed as: 


ad-0-ti "(he) eat-s” (3) 


The symbol “0” which indicates absence of the infix is the lopa defined as *some- 
thing that does not appear”. Here we meet the grammatical or linguistic zero. It 
occurs not only in verbs but also in nouns and it should be obvious that there are 
many of them. 

Panini's rules are generally ordered and he could have reversed the order, i.e., 
start with (2) and derive (1) by inserting the infix. Such problems are discussed 
by commentators and other grammarians, including Joshi and Kiparsky 1979 and 
Kiparsky 1991 who have shown that “lightness” is not simply an abbreviation but 
expresses generalization. It explains the famous saying: “grammarians rejoice 
over the saving of half a syllable as over the birth of a son". My examples do no 
more than illustrate the thesis, that the many linguistic zeroes of Sanskrit grammar 
led mathematicians to muse about one or more mathematical zeroes. 

Indic mathematicians did not only study Panini to compose works with all- 
India appeal. They were a small group of people, not popular or prestigious like 
mathematicians in the modern world. In India there was only one "science of 
the sciences" (sastranam $astram), the boundless (anantaparam), the summit 
of learning; grammar or vyakarana which literally means "analysis". Mathe- 
maticians were flattered to be associated with such a prestigious tradition. In the 
modem world, the opposite holds. Grammar is not a popular subject and many 
scientists do not even know what "linguistics" means. It developed as a serious 
discipline only after the discovery of Sanskrit by William Jones, Charles Wilkins' 
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Sanskrit grammar and Franz Bopp's adoption of the methods of the Indian gram- 
marians in his "Systems of Conjugation in Sanskrit compared with those in the 
Greek, Latin, Persian and Germanic Languages" of 1808 (see Thieme 1982/83, 
Staal 1989 and 1993, Chapters 4 and 5, and 1995: 109). 

The histories of linguistics, logic and the theory of ritual in India and Europe 
were first compared in Staal 1988: 36-45. It presented graphs of developments 
that gave a rough idea of the ups and downs. Logic included mathematical logic, 
but I did not then, and would not now be able to include mathematics. "The 
Theory of Definition in Indian Logic" (1961, reprinted in 1988:90) referred to 
the occurrence of a kind of null-class presupposed in “modern” works such as 
the Tarka-samgraha of the seventeenth century AD. But in India, linguistics and 
logic were closely connected almost from the beginning. 

Panini distinguished different zeroes from each other by making use of a rich 
conceptual apparatus. He was aware of the fact that the language of his grammar 
was modeled in part after the language that was the object of his study: Sanskrit. 
It could lead to confusion unless the two were clearly distinguished. He therefore 
made a distinction between “rules” (sutra) and “metarules” (paribhdsa). Implic- 
itly working with what we would now call a metalanguage, Panini made explicit 
use of meta-linguistic markers which he called it. To distinguish the different ze- 
roes from each other, he made use of the fact that lopa comes from a verbal root 
that starts with "I" and to which we shall return. The meta-linguistic markers al- 
ways have an "]" that marks them as dealing with zeroes such as luk, lat, lit and 
slu, each defined for particular classes or special cases. In modern transliterations, 
they are indicated by capitals (which Sanskrit has no means of distinguishing from 
small letters). The uses of metalanguage in Sanskrit grammar have been studied 
by Scharfe 1961 and Staal (1963b, 2003: 353-6); rules and metarules arestudied 
separately in Staal forthcoming. 

English grammar does not use such meta-linguistic markers but it could do 
something similar. It may be illustrated with the help of a rough sketch of English 
noun pluralization (a formalized grammar of such a topic may look quite different 
and require a substantial book). We shall begin with a general rule, where P is the 
plural marker: 


noun + P > noun + suffix (e)s 


This is a context-sensitive rule in which > stands for “is replaced by”; + stands 
for concatenation; and parentheses express options that distinguish does from 
witches. The general rule as stated does not account for fish or sheep which require 
a zero-suffix. 

My account, so far, applies to written Engli ; 

ccount, I glish. It does not ex i 
pronunciations of the written s, which may sound like “s” or “ diens ciis 


account for pluralization in both written and spoken Enflisisse es prea 
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variety of expressions. I shall not belabor the point but Panini's way with meta- 
linguistic zero-suffixes for special cases may be illustrated again for English by 
using subscripts as linguistic markers, for example: 


| " man+0; > men 
woman+02 > women 
mouse +03 > mice. 


j The spelling of English is idiosyncratic but that of Sanskrit, in that respect 
$ closer to Italian or Spanish, is rational. It is adopted by all Indic syllabaries in 
ni South and Southeast Asia and in the Roman transliteration adopted by Sanskrit 
; scholars worldwide. That transliteration writes the ou of English mouse as au and 

"n j the i of mice as ai. They are part of an extended system with similar sound corre- 


| ! spondences in Sanskrit and Indo-European. Sanskrit derives from nouns such as 
Siva the adjective Saiva which, in English, became "Shivaite". Similarly, the noun 
b rudra produces the adjective raudra to which no English adjective corresponds. 


4. Zeroes in the Srauta Ritual 


I shall end our discussion with the Vedic Srauta ritual which belongs to the pre- 
history of zero as well as that of Sanskrit Grammar. Vedic ritual is, therefore, a 
parent as well as a grandparent of zero. All forms of Vedic ritual are concerned 
with recitations, chants and acts. Recitations come from the Rigveda, chants from 
the Samaveda and ritual acts are the chief concern of the Yajurveda. Vedic ritual 

à science or theory (Staal, forthcoming) in one of its later forms which de- 
een roughly the tenth and Seventh centuries BCE and became known 


E id not | take place among other higher animals 
TK theories because they do not possess lan- 
< about ritual with each other, introduce 
started thinking ritually; but it did not 
been the first. Japanese. theologians in- 
2003). Modern students have 
O ee psycholo- 
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Sanskrit grammar deals with Sanskrit and influenced modern linguistics. The 
Srauta ritual did not influence any modern theory of ritual because there is no 
such theory. It implies that we cannot adopt Needham's rule as formulated on page 
3 above: we cannot do “the only thing we can do” because there is no yardstick. 
The reader will accordingly meet with unfamiliar concepts, methods and modes of 
analysis. Some are discussed in Chapters 7, 12 and 13 with their Source Notes in 
Staal 2008. But unless we are acquainted with one of the few surviving traditions 
of Srauta ritual and/or some of the literature in Sanskrit and in modern languages 
about Vedic ritual, we shall be on our own. 

Basic to any ritual performance are space and the four directions. The rit- 
ual arena consists of several sheds with thatched roofs that are temporarily con- 
structed for each performance. Some of the most important Srauta ceremonies 
are performed in a small space at the center that is called the Sadas. The word 
is derived from the Sanskrit verb sad- or "sit" which occurs in the contemporary 
term upanisad, "sitting close to" (the teacher). 

The Sadas looks as follows with the north on top: 


RIGVEDA > |< YAJURVEDA 


Vedas are recited in the four directions which the reciters themselves must also 
face. The above sketch makes use of two directions that are indicated in the figure 
by symbols we have used before but that now have a new meaning: > means 
“facing east” and < means “facing west”. These directions raise a host of technical 
and theoretical problems, some of them discussed by Caland and Henry (1906-7: 
232) and Keith (1914: I, 252 note 4). I shall mention two. The first is concerned 
with the directions only. The second combines directions with the verbal root from 
which lopa derives. 

The first case is illustrated by the sketch. It depicts a change of directions that 
has just taken place. It does not involve the Rigveda which is recited inside the 
Sadas by an officiant who is already sitting there, facing and reciting east. I shall 
call him R. The Yajurveda is recited by a priest I shall call Y, but he has come from 
outside the Sadas and cannot easily enter. He has made several turns already and 
will eventually face west and face R: the auspicious result that is depicted here. 
Earlier, the two officiants did not only fail to face each other but Y sat with his back 
to R - a situation that is to be avoided at all costs. The entire episode illustrates 
how the rivalry between different factions may be resolved and overcome. 
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The stage is now set for the second case. A dialogue unfolds which is initiated 
by R who recites a proposal: 


“Let us both recite!” (Sornsavo) 


Y responds from outside the Sadas with a touch of flattery. His verse consists 
of two halves: 


“Let us both recite, divine one!" (Sornsavo daiva) 


“Recite! Let us both rejoice!” (samsa madeva) 


Suppose Y were to omit the second half-verse. I have never witnessed it but it 
must have happened often because the two halves are very similar. The problem 
is addressed by the Yajurveda: “if the response after the half-verse were omitted 
(lupyeta) it would be like someone being left behind by others who are running 
ahead" ( Taittirzya Samhita 3.2.9.5). Here we have an instance of the verbal root 
lup- from which lopa is derived. The general meaning of the verb is "disappear" 
or "get lost". 

Louis Renou, whom I mentioned before, was the first to draw attention to the 
numerous ritual uses of lup- in his 1941-1942 study on the connections between 
Sanskrit grammar and Vedic ritual (465, note 83). It is also Renou, “the most com- 
plete Sanskritist" as he was called by V. Raghavan (1956: 20), who first demon- 
strated the historical precedence of ritual over grammar. 

I conclude that ancient India reverberated with zeroes, zero entities and zero 
events long before the geometry of the Sulva Sütras which are post-Buddhist. 
What may be called the prehistory of zero was expressed in early Vedic by kha 
which refers to cavities of various sorts and occurs in the Upanisads in the sense 
of "space". The Srauta Sütras, late Vedic but pre-Buddhist, used lopa to refer to 
omissions, disappearances and things that are lost. It is here that the origins of the 
mathematical concept of zero seem to lie. We do not know where it happened if it 
happened only once, but the most likely place would be the Kuru region north of 

modern Delhi though it may have been further east in Magadha, which overlaps 
with modern Bihar. The time must have been after 1,000 and before 600 BCE when 
f the creative period of the Śrauta ritual was over. It is a long period with smudgy 
| edges but there it is. 

Reverting finally to arithmetics there are important questions that I have not so 
far considered. How was zero conceived as a member of the number series? Takao 
Hayashi has suggested that it may be related to additions such as 15 + 20 = 35 (in 
modern symbols) which presuppose 5 + 0 = 5 plausible enough but when did it 
happen? Or could zero be related to the recursive principle underlying the decimal 
E I number names (and hence their construction) which, according to P.P. Divakaran, 
I4 was perfectly well understood in the early Rigveda? Plausible also, but would it 

j imply that zero was conceived as the beginning of the infinite series of natural 
r numbers, so that one would count 0, 1, 2, 3 etc.? 
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Let us return once more to our explanation of the origin of the mathematical 
zero in terms of the assignment of special meanings to Sanskrit terms such as /opa, 
Sunya or bindu. We are fortunate to possess the records of such events. Similar 
words occur in other human languages but did not undergo a similar development 
as far as I know. In their Indic evolution, /opa was inspired by ritual but is that a 
necessary part of its prehistory? All we know is that it added the flesh of another 
empirical discipline to the bones of linguistics, the discipline that underlies our 
understanding of the development of language. As far as I can tell, thus far, two 
conditions must be satisfied before a concept of zero may arise: there needs to be 
a language as well as another formal structure in which that language is used to 
signify that something has disappeared or is lost. 
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Combinatorial Methods 
in Indian Music: Pratyayas 
in Sangitaratnakara of Sarngadeva 


Raja Sridharan, R. Sridharan and M. D. Srinivas 


1. Introduction 


Six combinatorial tools (called pratyayas) have been in systematic use in India for 
the study of Sanskrit prosody ( Chandas-sastra) and these go back in time at least 
to Pingala (c. 300 BC?). Among these, three—prastara (an enumeration rule for 
generating all the possible metrical patterns of a given class as a sequence of rows), 
uddista (the process for finding, for any given metrical pattern, the corresponding 
row number in the prastara) and nasta (the converse of uddista)—are found in 
Bharata's Natyasastra, in the chapter where prosody is discussed. Incidentally, 
the problem of placing Pingala and Bharata in the chronology of time still re- 
mains an unsettled question. The notion of pratyayas was perhaps discussed in 
other ancient texts of music also. However, the first extant text on music where the 
pratyayas are systematically dealt with, both in connection with patterns of musi- 
cal phrases (£anas) and patterns of musical rhythms (tālas), is Sarigitaratnakara 
of Sarngadeva (c.1225 AD). Narayana Pandita in his Ganitakaumudi (1356 
AD) deals with some of these questions in a more general context, though his 
theory does not cover the kind of tala-prastara considered by Sarrigadeva. 

Our aim in this article is to highlight the contributions of Sarrigadeva and ex- 
plain his work in a mathematical set up. We first discuss the sequential generation 
or enumeration of patterns of musical phrases, called tana-prastara. The method 
of generating these patterns, as discussed in the first chapter of Sarigitaratnàkara, 
is essentially a rule for generating sequentially the n! permutations of n symbols. 
We note that the prastara, and the nasta and uddista processes are all indeed 
encoded in a certain unique representation of any integer in terms of sums of fac- 
torials. We also explain how Sarrigadeva employs a tabular figure, khanda-meru, 
to essentially go back and forth between any integer and its representation as a 
sum of factorials. 

We then move on to discuss the pratyayas for patterns of musical rhythms, 
tala-prastara. This theory has been dealt with at length in the sixth chapter of 
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Sarigitaratnakara. Itis in fact a generalisation of the theory of pratyayas for 
moric metres or matra-vrttas, where the short syllable (laghu) is taken to be of 
one matra (metrical time unit) and the long syllable (guru) is taken to be of two 
matras. Sangitaratnakara considers musical rhythmic patterns (tālas) made up 
of druta (of one time unit), laghu, guru, pluta, which are of 2, 4 and 6 durations 
respectively, of that unit of druta. 

It should be noted that the notion of pluta, viewed as a prolated vowel, consid- 
ered to be of the same duration as three /aghus, appears already in ancient Sanskrit 
prosody. In fact the notion of pluta occurs in the Rgveda only three times, though 
much more frequently in other Samhitas and Brahmanas. In classical Sanskrit, 
a pure consonant sound called a vyanjana (without any vowel attached) is said to 
be of half a matra. A laghu is said to be of one matra, guru of two matras and 
a pluta of three matras respectively. Clearly, as it is clumsy to handle fractions in 
music, time units of 1, 2, 4 and 6 were perhaps introduced and called druta, laghu, 
guru and pluta respectively. 

Sarigitaratnakara first presents a systematic method of enumerating all the 
tālas of a given time duration in a prastára, and follows this up with a complete 
mathematical theory of pratyayas which is a generalisation of the corresponding 
theory for moric metres. An interesting feature of tala-prastara is that the total 
number of patterns (the samkhyarika), if laid out in a sequence, has a generating 
function which involves a polynomial of the sixth-degree. This is an analogue of 
the notion of samkhyds for the moric metres, which are given by the so-called Fi- 
bonacci numbers (a result which is already present in the seventh century Prakrta 
text on prosody, Vrttajatisamuccaya of Virahanka), which have as their gener- 
ating function the inverse of the quadratic polynomial (1 — x — x7). In fact the 
sequence of samkhyarikas associated with tala-prastd@ras satisfies a more com- 
plex (four term) recurrence relation and has the inverse of the sixth degree poly- 
nomial (1 — x — x? — x4 — xô) as its generating function. Based on this generating 
function, we can compute the number of tala patterns in the prastara, which have 
a given number of drutas, laghus, gurus or plutas. This is an analogue of the 
pratyaya known as ekadvayadi-lagakriyà in Sanskrit prosody, and is discussed 
in Sarigitaratnakara for the tala-prastara by introducing various tabular figures 
called merus, which are constructed on the basis of systematic recurrence rela- 
tions which can be derived from the generating function mentioned above. 

The discussion of pratyayas in prosody and music lead to the study of com- 
binatorics related to three important ways of representing any non-negative inte- 
ger, representations which are widely in use even today.! While discussing the 
pratyayas for varna-urttas, Pingala gave the procedure for finding the binary 
representation of integers. Much later, Narayana Pandita, in his Ganitakaumudz 


lSee for instance, A. S. Fraenkel, Systems of numeration, Amer. Math. Monthly 92 (1985), 105- 
114. 
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(c. 1356 AD), generalised the method of Pingala for an arbitrary radix. The dis- 
cussion of pratyayas for matra-urttas led to the discovery of the so called Fi- 
bonacci representation of integers which expresses any integer uniquely as a suit- 
able sum of Fibonacci numbers. In his discussion of tala-prastara, Sarrigadeva 
introduced a generalization of the Fibonacci representation, where the Fibonacci 
numbers are replaced by the sarnkhyarikas of tala-prastaras. Another form of 
generalization was considered later by Narayana Pandita. As we already noted, 
Sarrgadeva, in his construction of khanda-meru, implicitly employed the so 
called factorial representation of integers in his discussion of the pratyayas for 
tana-prastara. 


2. Tana-Prastara 


Sangitaratnakara of Sarngadeva (c.1225) is one of the seminal texts of Indian 
music. It acts as a bridge between the ancient texts like Natya-sastra, Brhaddesi 
and Dattilam and all the major later texts. As Sarügadeva states, his family 
originally came from Kashmir and his grandfather, father and he himself received 
patronage from king Singhana (1210-1247) of the Yadava dynasty ruling from 
Devagiri in Maharashtra. 

Chapter I of Sarigitaratnakara is the Svaragatadhydya. In this Chapter, the 
fourth section is entitled Grama-mürcchana-krama-tana-prakarana. Of the 91 
verses of this section, verses 61—70 deal with the pratyayas associated with tana- 
prastara. In the following discussion, we give the textual verses,” which discuss 
each of the pratyayas, along with a brief paraphrase? and necessary mathematical 
explanation. 

A tana is a sequence of the seven svaras (basic musical tones or notes) which 
we denote in their natural order by S, R, G, M, P, D, N. In this section, Sarnga- 
deva considers the problem of enumeration of all the possible tanas where each 
svara appears only once.^ For instance, if we consider the four svaras S, R, G, M, 


2We have used the following edition of the work: Sarigttaratnakara of Sarngadeva, edited with 
Kalanidhi of Kallinatha and Sarigitasudhakara of Simhabhüpàla, by S. Subramanya Sastri, 
2"4 Ed. revised by S. Sarada, Adyar Library, Madras: Vol. I, 1992; Vol. III, 1986. The excellent 
commentaries of Simhabhüpala (c. 1350) and Kallinatha (c. 1420) present lucid explanations of 
the text along with examples. 

3English translation of the verses on Tanaprastara, in chapter I (Svaragatadhyàya) of the text, 
are available in: Sarigitaratnàkara of Sarngadeva edited with English translation by R. K. Shringy 
and Prem Lata Sharma, Vol. I, Motilal Banarsidass, Varanasi 1978. English translations of the verses on 
Talaprastara, in Chapter V ( Taladhyaya) of the text, are available in: Akella Mallikarjuna Sarma, 
Tala Prastara of Nisfarika Sarrigadeva's Sarigitaratnakara, Sannidhi Sangita Publications, Hy- 
derabad 2001. 

4Narayana Pandita in his Ganitakaumudi has generalised the theory of tanaprastàra to 
include tanas which involve repetition of svaras also. 
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there are 24 tanas, which represent the various possible permutations of them. 
| Sarngadeva gives a systematic method of enumerating these permutations in a se- 
quence. Before discussing his procedure for enumeration of the tänas, Sarngadeva 
P. first states the total number of number of t@nas—this is the first pratyaya known 


i as the samkhya which is to count the number of tānas with a given number of 
} svaras: 


————— € 


C EIEEIESSI SIE GPS E RIT FATT | 
ed yio vw dy wg wt RI 
Y Teen VAT Fao WR 


(AANA: 9.4.6 2-2) 


The above verses state that the total number of tānas with r svaras is given by 
$ the product 1.2.. .r, forr = 1,2, ...7. This is nothing but the statement that the 
| total number of permutations that can be formed from r distinct svaras is r!. 


—— MÀ ———Ü—À Gat 


2.1. The rule for the construction of the Prastara 


PH AA qu NT: qd: qd: WTN: | 
a Ago aed: yaaa: i 
OPARA YÈ Sur WAR ÉT: | 


AANEAFT: ?.8.69-3) 


nct svaras can be enumerated in the form of an array in the 
n ne first row the svaras are written in the ascending order. 
e prastara are Seres by the following rule: In each row 


as in MTM order. This completes the 
ill th govi is reached where all the svaras 


onside: the case of four 
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Here there are actually two svaras, R, S, which are lower and are to the left of G, 
and the highest of them R is placed below G. The svara to the right M is brought 
down. The remaining svaras, S and G, are placed in the ascending order to the 
left, thus completing the third row as $G RM. And so on. The entire prastara is 
shown in Table 1. 

Sarngadeva's rule for the construction of the prastara can be presented in a 
general context by considering the enumeration of the permutations of n distinct 
elements with a natural order, which we may assume to be a4, 42, 43, ..., dn. The 
prastara of these is ann! x n array of n symbols in n! rows, which will be denoted 
by [a1, a2, a3, ...a5]. We will also denote by [a;], the 1 x r column vector with all 
the entries given by aj. 


LTTSTR[STMTTa [a [as a 
[2 | |R[S]G]M] [a [ai [as 
[3| ES TG[RTM[ [ar [2s [22 | as 


a4 


E 
EJBEIEIESERBESETEIES 
[24] |M]G]R[S | [as] as] a | at | 


TABLE 1. Tana-prastara of SRGM. 


We explain the rule of enumeration by an inductive procedure. To start with, 
the prastara [a1] of one element is simply the 1 x 1 column [ai]1. The prastéra 
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[a1, a2] of two elements is the 2 x 2 array 


aj a2 
a? a| 
which can also be written as 
[a1] [a2]; 
[a2] [a1] 


The prastara [a1, a2, a3] of three elements is a 6 x 3 array, which (in our 
notation) has the form 


[a1, a2][a3]2 
[a1, a3][a2]2 
[a2, a3][a1]2 


Having obtained the prastara of three elements [a1, a2, a3], we can see that the 
prastara of four elements [a1, a2, a3, a4], explicitly shown in the preceding Table 
. 1, has the structure 


[a1, a2, a3] [a4]6 
[a1, a2, a4] [a3]6 
[a1, a3, a4] [a2]6 
[a2, a3, a4] [a1]6 
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at the left. This is precisely the permutation (a1, a2, ..., An—2, An, Gn—1) which 
appears in the above general form at the row numbered (n — 1)! 4- 1. By induction, 
it can be seen that the array given above is indeed the prastara of n elements 
according to the construction of Sarngadeva. 

We may also note that the above prastara enumerates all the permutations 
of n distinct elements in an order which is different from the so-called lexico- 
graphic order, which is perhaps more commonly used in modern combinatorics. 
Sàrügadeva's rule actually enumerates the permutations in the what is sometimes 
referred to as the colex order—the mirror image of lexicographic order in the re- 
verse. 


2.2. A “factorial representation" of integers 


After discussing the prastara or the enumeration process, Sarngadeva discusses 
the pratyayas, uddista and nasta. Uddista is the process by which, given a tana 
or permutation of the svaras, one finds the number of the row in which it appears 
in the prastara (or enumeration), without going through the process of generating 
the entire prastara. Nasta is the converse process by which, given the number of 
the row, the corresponding tana or the permutation is determined. The processes 
of uddista and nasta are discussed by Sarngadeva in terms of what is called 
the khanda-meru, and are essentially based upon a certain representation of any 
integer uniquely as a sum of factorials. 

In this section we shall outline some mathematical aspects of this representa- 
tion, which is a variant of what is widely known as the factorial representation of 
integers. We shall then consider how Sarngadeva’s construction of prastara and 
the uddista and nasta processes are encoded in this representation. We begin with 
the following: 

Proposition 1: Every integer 1 < m < n! can be uniquely represented in the 
form 


m = doO! + dil! + d22! +... + di -1(n — 1)!, (1) 


where d; are integers such that do = 1 and 0 < d; < i, fori = 1,2,...,n — l. 

Proof: We prove the existence of such a representation by actually giving an 
explicit procedure by which every integer 1 € m < n! can be represented in the 
form (1) given above. First, dn—ı is obtained as the quotient in the division of m 
by (n — 1)!, where we stipulate further that, if m is divisible by (n — 1)!, then the 
quotient is reduced by 1 and (n — 1)! is taken as the remainder. That is, d; —; and 
rn—| are given uniquely by 


m — dy-j(n — 1)!  ra-i (2) 
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where, 0 € dj-j € n — l and O0 < r4; < (n — 1)!. Next, we find d,—2 and 
Tn—2 Similarly by dividing r„—ı by (n — 2)!. And so on. When we finally reach r2, 
since 0 < r2 € 2, we set dı = 1 if r2 —2andd; = 0 if r2 = 1. Then, it follows 
that dy = 1, and we have thus a systematic procedure by which every integer 
1 < m < n! can be represented in the form (1) given above. In particular, for any 
r € n we have a fascinating relation (with do = 1, d; =i, forall 1 <i € r — 1), 


ri-—1.0!-4- 1.1122! 4-...-- (r — 1)(r — 1)! (3) 
The uniqueness of the representation (1) can be shown by induction on m. We 


note that since the uniqueness is clear for n = 1, 2, we may certainly start the 
induction. Assume that m > 2 and let 


m = 1.0!+dıl!+d22! 4 ... -- d41(n — 1)! 
1.0! -- di! + d52! +... +d’ (n — 1)! 


V gai 5n—1 # 9, then 
—  m-d,-(n—1) = 1.0! +. dy]! d- d221 4... + d 2 (n — 2)! 
= L0!'cdjl!4452!-...- d, 5(n — 2)! 


so es Quy induction on m) it follows that d; = d; for 1 < i < (n — 2) also. If 
dn; # d pon then we can assume without loss of generality that d,-; > d,. 4 


= 10!-4 dil! + d22! + ... + dy 2 (n — 2)! 
i +(dn-ı — d,—ı)(n-— 1)! 
1.0! 4- dj! 4- d52! 4- ... +d —2(n— 2)! 


tation (1 
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proposition, reduces to the following relation (with d; — i, for all i), 
n! = 1.0! t- 1.1! 4- 2.2! - ... 4 (n — 1)(n — 1)!, 


which has been mentioned already. 

Corollary: For any integer ] < m < n!, theassignmentm — (dj, d5, .. 2d); 
given by equation (1), is a bijection of the set of natural numbers < n!, with the set 
of all (n — 1)-tuples of integers ((d1, d2, ..., d, —1)]0 € dj € i, i = 1,2, ..., n— 1}. 


2.3. Uddista and nasta processes and the factorial representation 


We shall use the factorial representation discussed above to understand the pro- 
cesses of uddista and nasta as also the method of construction of the prastara. 
We shall first show that there is a one to one correspondence between the set 
(ai, Ain, ..., Aip) of all permutations of n distinct symbols and the coefficients 
(do = 1, d1, d2,...,dn—1|1 € di € i, i = 1,2,...,n — 1)) which arise in the 
factorial representation of numbers 1 < m < n!. As an illustration of this cor- 
respondence, we present in Table 2 the prastara [a1, a2, a3, a4] of four distinct 
elements along with the coefficients do = 1, dj, d2, d3 of the factorial representa- 
tion for each row-number m for 1 < m < 24. 

Given any permutation (a; (1), dg (2); - - -, dg (n)), the corresponding set of co- 
efficients d1, d2, . .. , dn—ı are determined sequentially, starting first with d,_}, 
by carefully following the inductive process that we described for generat- 
ing the prastara. As we mentioned earlier, the prastara for n elements 
[a1, a2, ..., An—1, an] can be written down using the prastara for n — 1 elements in 
the following manner: 


[a1, a2, ..., àn—2, an—1] [an]l(n—1)! 
[a1, a2, ---, An—2, Gn] [an—1](n—1)! 


[a2, a5, ..., An—1, an] [a1](n—1)! 


Each of the blocks in the above prastara has (n — 1)! rows. Thus, the last element 
in the first (n — 1)! rows is a, and this corresponds to d,,_; = O in the factorial 
representation of the corresponding row-number. In the next (n — 1)! rows, the 
last element is a4—, and this corresponds to d;-; = 1. And so on. Thus we 
see that if the last element of the given row is ag (n) where o is a permutation of 
1,2, ..., n, then the corresponding row-number will have a factorial representation 
with d,_; = n — a(n). Then, we remove the element. ag(n) and consider the 
prastara of the remaining n — 1 elements and determine the coefficient d,— in 
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the same way as we found d, by considering the last element a5 (;—,) of the 
permutation (a, (1), dc (2), - - - Qc (n—1)). For this purpose, let ø (n — 1), be the j-th 
element when we arrange o (1), ø (2), ..., c (n — 1) in the increasing order. Then, 
dn—-2 = n — l — j. Again, let c (n — 2), be the k-th element when we arrange 
o (1), a (2), ..., c (n — 2) in the increasing order. Then, d,-3 = n —2 — k. We 
continue this procedure obtaining d; up toi = 1, and then set do = 1. 


imn] | fff fo fa | ae | as | 
A| [ar [ao [as [as] | 1 | 0 | 0 | 0 | 


eC 


C dues 


HE 
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As an example of the above procedure, we may, for ; 
number of the permutation (a3, a2, a4, a1) in the eee compute the row 
a(1) = 3, o(2) = 2, o(3) = 4 and c (4) — 1. Med csl Here 
outlined above, d3 = 4 — o (4) = 4 — 1 = 3. The EJEA A eus procedure 
the third element of a2, a3, a4 and hence d5 = 3 — 30h oe) = a4, s as 
occurs as the first element of a2, a3 so that dios m i e EY ds (2) = a2, 
of (23,22; 24,21) is OLF LIF 0:21-13181520 TR eae 
We now indicate the converse process of nasta (or finding the "lost" row) 
which gives a procedure to find, for any integer m < n!, the m-th row of die 
prastara [a1, a2, ..., an]. We begin with the factorial representation of m: 


m = d00! + dil! + d22! + ... + d .1(n — 1)! 


We set c (n) = n — d,—, and take dg (n) as the last element of the row. We re- 
move ag (n) from the set aj, a2, ..., a; and arrange the rest of the elements in the 
ascending order. If j = n — 1 — d,—2, then we let the j-th element of this se- 
quence be the penultimate element a; (n—1) of the m-th row. We next omit a; (n) 
and a5(,—1) and arrange the rest of the elements of the set aj, a2, ..., aj in the 
ascending order. If k = n — 2 — d,_3, then the k-th element in this sequence 
will be the next element a; (n—2) in the m-th row. We continue this procedure till 
we use up all dj, ] < i < (n — 1), and define a5(1) to be the remaining sym- 
bol left in 21, a2, ..., an. Thus the permutation in the m-th row is determined to 
be (ac (1), 4c (2) ---, 4c (n))- This is the nasta process. Sarngadeva also follows 
essentially the same method as we shall see later. 

To illustrate the above procedure, we shall find the 15th row of the prastara 
[a1, a2, a3, a4]. We have 15 = 1.0!4-0.1! 4- 1.2! 4- 2.3!. The rule above shows that 
c (4) = 4—2 = 2. The last element in the row is thus a2. The remaining elements 
in the increasing order of the indices are a}, a3, a4. Since, o (3) = 3 — 1 = 2, we 
take the second element amongst them, namely a3, as the penultimate element of 
the row. Now we omit a3 also and are left with a1, a4. Since, ø (2) = 2—0-2. 
the next element of the row will be a4. The remaining symbol is 41- Thus the 15th 
row of the prastara is determined to be (a, a4, a3, a2). 

The factorial representation also sheds considerable light on the method of o 
structing the prastara. To see this, we take one more look at the way the prastari 
of n elements is constructed from that of n — 1 elements. 


(a), a2, ..., ,—2, Gn—1][an](n—1) 


[a1, a2, ..., an—2, an ][an—1](n—1)! 


[a2, 43, ..., An}, ag ])[a1](n—1) 
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f Here, we note that the n-th element changes for the first time (from a, to aj—1) 
i . after (n — 1)! rows. And, at the same time, the co-efficient d; in the factorial 
representation changes from 0 to 1. In other words, for numbers 1 € m < (n — 
1)!, dn—1 = 0; and we have d,—; = 1 for the first time when m = (n — 1)! + 1. 
Now, the number (n — 1)! has the factorial representation 


—À— —H— —0— 


(n — 1)! = 1.0! - 1.1! 221 +... + (n — 2)(n — 2)! + 0.(n — 1)! 


Here, we have (apart from the usual do = 1),d; = i for 1 < i € n — 2, and 


i F d,—, = 0. When we go to the next row, the representation is changed to 

I Hu © (n— D 1 10! 40.11 4-021 +... 2c 0.(n — 2)! - 1.(n — 1)! 
4 i That is, in passing from (n — 1)! to (n — 1)! + 1, the coefficient d, increases by 
' Y 1, and the coefficients d; change to 0 for 1 < i < n — 2. By using induction on n, 


we can easily establish the following more general result: 


Proposition 2: Let m be an integer such that 1 < m < n!, with factorial 
representation 


—— 


m = 1.0! -- di1! 4- d22! + ... + dai (n — 1)!, 


with 0 < d; <i, fori = 1,2,...n — 1. If dg(k > 1) is the first coefficient in the 
above factorial representation such that dg < k, then the factorial representation 
for (m + 1) is given by? 


m^ 


00 om1-10! td! *4)2!  ... d; 4 (n — 1)! 
xn p 

C <k, d, = d, + 1, and d; = dj for i > k. 

recise relation between the factorial representation 

erating successive rows in the prastara, dis- 


Ssec by Sarngadeva 
fme 9o 


PU F 


sed by Sarngadeva 


entation of integers 


- 1), then from (4) we have m = 
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discussed above. Sarngadeva bases his discussion of the uddista and nasta pro- 
cesses through a tabular figure referred to as the khanda-meru, which is essen- 
tially a device to read off the co-efficients d; which arise in the factorial represen- 
tation of any given number and vice versa. The khanda-meru is described in the 
following verses: 


LAAPA PEAARST: WATE: I 
dredmaramqee fren ay GHI 
daen Aa ad A EFTA 
Ware arg Anra: ferai | 
grau fee d usu: APEP, I 
FI AT GUSH HT: | 


AARAA: ?.2.3-E) 


The tabular figure khanda-meru consists of seven columns (as we are deal- 
ing with the seven svaras in tana-prastara) with the number of entries increasing 
from one to seven, from left to right. In the first row, place 1 in the first column 
(from the left) followed by 0's in the other columns. In the second row, starting 
from the second column, place the total number (or sémkhya) of tanas of 1, 2, 3, 
etc., svaras. Thus the entries in the second row are the factorials, 1, 2, 6, 24, 120 
and 720. In the succeeding rows, place twice, thrice etc. (that is successive mul- 
tiples) of the factorials, starting from a later column at each stage. Thus, the first 
column of the meru consists of just 0! = 1, and the n-th column, for n > 1, is 
made up of the multiples i.(n — 1)!, with 0 < i < n — 1, as shown in Table 3. 


F [1[2 6 24 [120 | 720 | 
[| [4 12 48 [240 | 1440 
[| |. [18 72 [360 |2160 | 
[| | 86 [480 [2880 
I] [609 [3600 
aes eed ee Ze 


TABLE 3. Khanda-meru. 


Thus, in the tabular figure of khanda-meru, Sarigadeva has a table of mul- 
tiples of all the factorials. He uses this effectively to find out the number of the 
row in a prastara that corresponds to a given tana (the uddista process), and con- 
versely (the nasta process), by precisely making use of the factorial representation 
that we discussed earlier. This will be clear from what follows. 
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2.4.2. Uddista Process 


Sarngadeva explains the uddista process as follows: 


$ WS Ar, Yd NA: R: d 
i. . sfera HSH ates fat 

$ SEA, AA i HAT NU od 
Serrage AR 


RSEIISECIE E 9?.8.&&-6) 


Given a tana (of n svaras), note the rank of the last svara counting the svaras 
from the end in the natural order (i.e., counting the svaras in the reverse of the 
natural order). Mark the corresponding entry in the last or the n-th column of the 


| } LR khanda-meru. Note the rank of the next svara (in the reverse of natural order) 
[159 among the remaining svaras. Mark the corresponding entry in the next or the 
ht (n — 1)-th column. And so on. The uddista or the rank-number of the given tana 
|] is the sum of the all the marked entries. 


Se 


Poppe [38 [30 rea 
[|| [18 | 72 [360 [2160 
TI | 480 | 2880 


, M in the: reverse order, so mark the 


he second among S, Rin 


v - T PA, WS CR 
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Clearly, the process of uddista, as set forth above by Sarngadeva, involves first 
working out the co-efficients d; which correspond to the given tdna or permuta- 
tion. Then the khanda-meru is employed to find out the row-number associated 
with these co-efficients. In the case of the given tana, MSRG, the coefficients 
turn out to be d3 = d2 = dı = 1. Then the corresponding row-number is found 
from the khanda-meru to be 


1.0!+ 1.1! - 1.2! 3- 1.3! 2 1 3-1 3-24- 6 — 10. 


2.4.3. Nasta process 


Sarngadeva explains the converse or the nasta process as follows: 


Rete Vaasa: d 
qq Ge fates aeea 4 | 
ACARI Tat aN ue oa 

Wea He: aT AIT 


RAS aaam PEE 
AANA: 9.2.&6-90) 


To find the tana (of n svaras) corresponding to a given rank-number or nasta- 
samkhya, mark that entry which is the greatest number strictly less than the rank- 
number in the n-th column of the khanda-meru. Subtract that entry from the rank 
number and mark the entry, which is the greatest number strictly less than the 
resulting number, in the next or the (n — 1)-th column. And so on. The position 
of the marked entry in the last column gives the rank of the last svara of the tana 
in the reverse natural order. The position of the marked entry in the next column 
gives the rank of the last but one svara, amongst the remaining svaras, in the 
reverse natural order. And so on. 

Example: To find the 18^" tana in the prastara of SRGM 


e In the fourth column of the khanda-meru, the greatest number less than 18 
is 12, which occurs on the third row. The fourth svara of the tana is thus the 
third among S, R, G, M in reverse order, namely R. 


e Now, 18 — 12 — 6. The greatest number less than this in the third column 
is 4, which occurs in the third row. The third svara of the tāna is the third 
among S, G, M in reverse order, S. 


e Now, 6 — 4 — 2. The greatest number less than this in the second column is 
1, occurring in the second row. Second svara of the tana is second among 
G, M in reverse order, G. 


e The first svara of the tana is the one left, M. 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


| t | 70 Raja Sridharan, R. Sridharan, M. D. Srinivas 
[ISJR[|IGIM[P[D] N | 


F [r[2] 6 |24 120 | 720. 
L| [4 12 |48 | 240 | 1440. 
[| | [18 72 [360 | 2160 | 
[||]. 56 [480 | 2880 | 
[|||]... | $00 [ 3600 
meis p 14220] 


TABLE 5. Khanda-meru for nasta. 


The 18'^ tana is therefore MGSR. 
Clearly, the process of nasta, as set forth above by Sarngadeva, involves first 
working out the factorial representation of the given number. In the above exam- 
i $ | ple, the number 18 was decomposed as 


18=1+1+4+12=1.0!+1.1!+2.2!+ 2.3! 


Here, the khanda-meru is used to straight away read off the coefficients 
dı, d2, d3, d4, which occur in the factorial representation of the given number. 
Once these co-efficients are determined, the tāna or the permutation can be ob- 
tained by the standard procedure discussed earlier. 


ect ml oa The famous mathematical text Ganitakaumudt 

ana d indita has a chapter on A7ikapaga. Here, Narayana 

g the prastara of the set of all permutations of 

55). He also uses the same khanda-meru in 

ddi a processes. As we have also remarked, the 
on a factorial representation of integers. 

Jlicitly, the rule for prastara given in 

el of svaras. Narayana 

ni, nasta and uddista pro- 
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3. Tala-prastara 


Apart from Sarnigita-samayasara of Par$vadeva (c.1165), which has a brief dis- 
cussion of tdla-prastara, Sarigitaratnàkara is the oldest text which deals with 
tala-prastara. Its treatment of the subject is much more detailed and definitive 
than that found even in several later texts such as Sarigitopanisad-sdroddhara 
of Sudhakarakala$a (c.1325). Nartana-nirnaya of Pandarikavitthala (c.1560) 
and Sarigita-darpana of Catura Damodara (c.1600). 

Chapter V of Sarigitaratnakara is the Taladhyaya with 409 verses. Starting 
with the definition of tala, the first 235 verses deal with marga-tdlas (classical 
rhythm patterns). Verses 236—311 discuss 120 desi-talas (regional rhythm pat- 
terns). At the end of this discussion, it is noted that there are indeed very many 
such £alas and it would not be possible to display all of them. This sets the stage 
for the prastara-prakarana which takes up the remaining nearly 100 verses of the 
Taladhyaya. 


3.1. Talangas 


The talargas, or the basic rhythm components, considered by Sarngadeva are 
four, namely, druta, laghu, guru and pluta, which may be taken to be of duration 
1, 2, 4 and 6 respectively, in druta units. Thus the tala-prastara considered in 
Sangitaratnakara is sometimes referred to as caturariga-tala-prastara.® 

This process of tala-prastara is a generalization of the process of matra-vrtta- 
prastara discussed in the text of Chandas-sastra. There, only laghu and guru are 
considered and they are of one and two time units respectively. Ganita-kaumudt 
of Narayana Pandita (c.1356) discusses a different kind of generalization of the 
matra-vurtta-prastara of Chandas-sastra, by including units which are also of 
3,4,..., q-mátràs in the prastara. The tala-prastara of Sangitaratnakara cannot 
however be subsumed under Narayana's generalization. 


3.2. Pratyayas 


Pratyayas are simple techniques to comprehend the diverse features of the large 
number of talas which are noticed in practice, without having to enumerate them 


Sin a series of books, the well known musician and musicologist Vidwan Akella Mallikarjuna 
Sarma, formerly Principal, Government College of Music, Hyderabad, has given a detailed exposition 
of the theory of tala-prastara as expounded in Sarigitaratnakara. According to Vidwan Sarma, 
there are some later texts of Sarigita-sastra (as yet unpublished), which include some more talangas, 
such as anu-druta etc., and different kinds of laghus (khanda-jati, etc.) in their discussion of tala- 
prastaras. It is important to bring these texts to light in order to understand further developments of 
the theory of tala-prastàra due to the successors of Sarngadeva. 
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explicitly each time. Sarngadeva first lists the nineteen pratyayas considered in 
this sastra. 


FAS BES FATTO SEIT 


AANA: y.329-326) 

The pratyayas which are being discussed by Sarngadeva are the following: 

e Prastara; Systematic generation and enumeration of all possible tala-forms 

— with a given total number of time-units. 
S e Samkhya: Determination of the number of tala-forms in a prastara. 

e Nasta: Ranking of a given tala-form in a prastara. 

e Uddista: Finding the tala-form corresponding to a given rank. 

œ Patala: Total number of drutas, laghus, gurus and plutas in a prastara. 
aa eru, Laghu-meru, Guru-meruand Pluta-meru: Finding the num- 

d 1 vin a given number of drutas, laghus, gurus and plutas, 
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Sarngadeva's procedure for the construction of prastara is as follows: 


The last row of the prastara has all drutas only. 


In the first row, place as many plutas as possible to the right, followed (from 
right to left), if possible, by a guru and a druta or a guru alone; or by a 
laghu and a druta or a laghu alone; or by a druta alone to the left. 


To go from any row of the prastara to the next, identify the first non-druta 
element from the left. Place below that the element next to it with lower 
duration: Place druta below a laghu, laghu below a guru and guru below a 
pluta. 


Bring down the elements to the right as they are. 


Make up for the deficient units (if any) by adding to the left as many plutas 
as possible, followed similarly by gurus, laghus and drutas in that order 
from right to left. 


Example: Sapta-druta-prastara 

We shall use the symbols D, L, G, P to stand for druta, laghu, guru and pluta 
respectively. The traditional symbols for these are somewhat like o, |, S, $. In 
Table 6, we display the sapta-druta-prastara or the enumeration of all rhythmic 
patterns of seven time units following the rule of enumeration of Sarngadeva. We 
also show the time duration of each talariga on the right. 


3.4. Samkhya 


Sarngadeva next discusses the pratyaya of samkhya, which gives the number of 
rows in the prastara associated with rhythms of a given time measure. 


GTS NIS: ¥.329-3Qx) 
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Tala forms 


-pramanas 
[S 


In terms of kala-pramanas 


EIECTUS III 8] 8] ST SI GI SI SII 
| e Pel Beh | e| keklek 
PT TTT el elhe | | | lhl] aha SS 
PET el | SEEEBISISISISBIS 


TABLE 6. Seven-druta-prastara. 


The sequence of samkhyankas Sn, where S, is the number of tala forms in 
the n-druta-prastara, is obtained as follows: The first two elements of the se- 


2. The n-th element of the sequence (Sn) is obtained by 


quence are Sı = 1, Sp 
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adding the previous (S,,_1), second-previous (S„—2), fourth-previous (5,..,) and 


the sixth-previous (S„—6) elements. That is, the samkhyarikas satisfy the recur- 
rence relation: 


Sn = Sn-1 + $,—2 3r Sn—4 + $,—6 


In case the fourth or sixth-previous elements do not exist, the third and the fifth- 
previous (if they exist) are to be taken as their representatives (pratinid/2i). As we 
shall see, this happens only in two cases, namely S4, S6. 

Sarhgadeva also makes the very interesting observation that among all the 
tala-forms which appear in the n-druta-prastara, S,—, end in a druta, S„—2 in 
a laghu, Sn—4 in a guru and S„—6 end in a pluta, and the total number of tála- 
forms S, in the n-druta-prastara is just the sum of these four numbers. We shall 
subsequently discuss the significance of this observation. 


BAMBI Pale 91 3 
[Sn | 1] 213 [6] 10] 19]33]60]106 | 191 | 


TABLE 7. Sequence of samkhyarikas. 


In Table 7, we give the sequence of samkhyarikas Sn. Here, Sı = 1,52 = 
2. $4 — 2--1 2 3, $4 = 3+2+1 = 6, S5 = 6+3+1 = 10, Sg = 10464241 = 
19, S7 = 19--104-3--1 = 33, Sg = 33--19--6--2 = 60, and so on. The number 
1 which appears in the sum for obtaining $4 and Se, is the third-prev 3ous and 
the fifth-previous samkhydrika respectively, and is used here as the representative 
(pratinidhi) of the fourth-previous and the sixth-previous samkhyankas- 4S they 
are absent. It is sometimes suggested that we can get rid of this prescr#ption to 
use the representatives, by introducing an entry So = 1, before the US x 
samkhyárikas.! The recursion relation Sn = Sn-1 + Sn-2 a See Soe 
samkhyankas is a generalization of the recursion relation 


Sn = Sn—1 + $n—2 
that defines the samkhyarikas sp in a mátra-vrita-p rastara. The aae E 
was noted by Virahanka (c. 650) and gives EC MY the so Called Fib? d uberi 
quence or, more appropriately, the Virahafka-Fibonacct sequence of # E 
Indeed the mathematical theory of tla-prastára turns out to be » yery j^ 


7See for instance, the works of Akella Mallikarjuna Samá ys dus bibliography: neralization 

ŝin Ganitakaumudi (c.1356), Narayana Pandita pon = seat kind of 84 + Sn—2 + 
of the Virahanka-Fibonacci sequence by cons x- oa of order q. "on Sn = Sn 
+» + Sn—q to generate what he calls as the samasikt po 
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TABLE 8. Four-druta-prastéra. 


generalization of the mathematical theory of the matra-vrtta-prastara as we shall 
see below. 


3.5. An inductive construction of the T'ala-prastara and the Samkhya 
rule 


The construction process of tala-prastara is indeed a generalization of the con- 
struction of matra-vrtta-prastara as described in texts of Sanskrit prosody. Whilst 
laghu (l) and guru (g) are units of value 1 and 2 respectively in prosody, in the 
theory of musical rhythms there are four units druta (D), laghu (L), guru (G) 
and pluta (P) which have values 1, 2, 4 and 6 respectively. 

In prosody, a row of a prastara, of a given value n, consists of a sequence 
of laghus (I) and gurus (g) such that the total value of the row is n, where the 
total value is by definition the sum of all the values of the symbols in the row. 
The prastara is actually an enumeration of all sequences of / and g by a certain 
tule, such that each row has value n, and all possible rows of value n are included 
under the rule of enumeration. The construction of the prastara is achieved by an 
inductive process. 

A similar process is at work in the case of tala-prastara. The prastara of 
value 1 is just D. So the corresponding samkhya (the number of tala-patterns 
corresponding to a given value) is Sı = 1. The prastara of value 2, consists of two 
rows, L and DD and correspondingly S2 = 2. The prastara of value 3, consists 
of three rows, DL, LD and DDD and correspondingly S3 = 3. The prastara of 
value four has six rows as shown in Table 8, aud correspondingly $4 = 6. 

The prastara of value five has ten rows as shown in Table 9, and correspond- 
ingly Ss = 10. The prastara of value six has nineteen rows as shown in Table 10. 
Thus S6 = 19. In each of the above cases, we have an enumeration of all rows of 
value 1, 2, 3, 4, 5 and 6. 

It is now clear that the tala-prastara of value n can be constructed in the fol- 
lowing manner. We first take the prastara consisting of rows of value n — 6 and 
augment it by adding a pluta (P) at the end. Below that, place the prastara of 
value n — 4 and augment it by placing a guru (G) at the end. Below that, place the 
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| | [E|L[D| 
16| |D|D|L|D]D| 
[7| |D|L]|D|D]D| 
18| [L|D|D|D]D]| 
[19 |D|D|D|D|D[D] 


TABLE 10. Six-druta-prastara. 


prastara of value n — 2 and augment it by placing a laghu (L) at the end. Below 
that, finally, place the prastara of value n — 1 and augment it by placing a druta 
(D) at the end. We can see the above process clearly at work by noting that the 
prastara of value 7, given in Table 6, is constructed in the manner described above 
from prastaras of values 1, 3, 5 and 6. 


——— aa 
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3.6. The generating function for Samkhyankas Sn 


We first explain how, in the case of matra-urtta-prastara in prosody, the sequence 
of samkhyas (the number of moric metres of a given value) may be obtained. For 
example, let us consider the row /gg which occurs in the prastara of rows of 
value 5. We associate a term in the expansion (x + x”)? namely the term x? which 
is obtained by taking x from the first factor and x? from the second and third 
factors and multiplying the monomials so chosen. To consider another example, 
to the row glll, we associate a term in the expansion of (x + x2)* namely the 
term x? which is obtained similarly by taking x? from the first factor and x from 
the second, third and fourth factors. From such examples it is clear that if sn is 
the number of rows in the matra-vrtta-prastàra of value n, then the generating 
function of s, is given by 


pu = lihe A (xe x)? +... (x + x2)" + 
U= rx»! 


It turns out that the number 5; is the so called n-th Fibonacci number, or what may 
more appropriately be called the n-th Virahanka-Fibonacci number. 

Tt follows in a similar manner that the generating function for the samkhyankas 
Sn (the number of rows in the £ala-prastara of value n) is given by 


Y Sach TUTE (EAST peer ox el 47x9)? -- 
TON +(x +x? x* + x)" + 
BIS Cote? = xt xy! 
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that characterizes the Virahanka-Fibonacci numbers. 


3.7. The decomposition of an integer in terms of Sarhgadeva numbers $, 


In this section we shall exhibit a procedure to decompose an integer m > 1, in 
terms of the Sarngadeva numbers $, using induction. The motivation for this 
comes from the following result for the Fibonacci numbers which was demon- 
strated in the mathematical theory of mátrà-vrtta-prastara.? 

Proposition: Any positive integer is either a Fibonacci number or can be ex- 
pressed uniquely as a sum of non-consecutive Fibonacci numbers. 

In what follows, we shall prove an analogue of this result for Sarügadeva num- 
bers. If m = 1, we write 1 = Sı. Then we write 2 = $5, 3 = $3, 4 = Sj + $3. 
Now, for any general m, choose the largest integer q such that Sg < m. By induc- 
tion, following the same procedure, we can decompose m — Sq, that is write it in 
the form 


m — Sq = Si + Si +... + Si 
Then, 
m= Si + Si +...+ Si, + Sq with ij Si2<...Sip <q (8) 


is a decomposition of m which we shall refer to as the canonical decomposition. 
Here, for any m, with 1< k < p, iy is the largest integer such that S;, < (m — 
(Sigg; + Si s +- - -+ Si, +5q)). We may also note that if m is itself a Sarngadeva 
number Sq, then m = S, is the canonical decomposition of m. We shall now 
show that the canonical decomposition has certain properties. For this we need 
the following lemma. 
Lemma 1: The Sarngadeva numbers $, satisfy the following relations: 
(i) 258, > $541, and equality holds only if n = 1,3 (9) 
(ii) Sn + Sn+1 + Sn42 2 Sn43, and equality holds onlyifn=1 (10) 
(iii) Sn + Sn+1 ar Sn+3 ar Sn+4 2 Snt5> and equality holds only ifn = 11 1) 


Proof: These relations follow directly from the recurrence relation (6). We 
have 


Sn = Sn=1 + Sn—2 + Sn—4 + Sn-6 


The mathematical theory of matrà-vrtta-prastàru is outlined in R. Sridharan, Pratyayas for 
Matravrttas and Fibonacci Numbers, Mathematics Teacher, 42, 2006, 120-137. 
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— 'e— 39 


We also note that for n > 1, Sn > S,—1. Hence 


= pe ge a 


Il 


2Sn 2Sn—1 + 2Sn—2 + 2Sn—4 + 2Sn—6 


Sn + Sn—1 + Sn—3 + Sn—s (by induction) 
$41 . 


| Iv 


di 
pom ei m hy 


= HÀ mm 
eu 


| This proves (i). Now we prove (ii) by induction. We note that property (ii) holds 
i if n = 2. Now, we use the recurrence relation (6) to write 


oer ee 


———— VE ABE IES INES = 


Sn + $41 ar S42 = (Sn-1 + Sp + Sn41) + (Sn—2 + Sn—1 + Sn) 
+(Sn—4 + Sn—3 + $,-2) + (Sn—6 + Sn—5 + Sn-4) 
Snt2 + Sn+1 + Sn-1 + Sn—-3 (by induction) 
Sn43 


ex -— 
uocis 
se "m m 
IV 


v ri B ~ 
x 


This proves (ii). Property (iii) follows directly from the recurrence relation (6), 
as we have 


" 


"um E cas 


= 


= ot Sn + Sn43 + Sn Z Sn—1 + Sn + Sn43 + Sn+4 = Sn45 


This completes the proof of the lemma. 
Now consider some number m < S,, and let 


M vat 


with ij «i2 <... <i; (12) 


as of the number, where we have taken note of the 
rom (9) and the construction of the canonical decomposition) 
x Si ppears at the most once in the canonical de- 
n eor write the above decomposition in the 


s the foll 
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(b) A combination of the type (1, 1, 0, 1, 1) cannot occur as a part of the above 
sequence. 


(c) A combination of the type (1, 0, 1, 0, 1, 1) cannot occur as a part of the above 
sequence [This follows from the recurrence relation (6)]. 


(d) The first two entries cannot both be Is in the above sequence [This follows 
from the fact that the first three Sarngadeva numbers are related by $3 — 
$1 + S2]. 


(e) The sequence cannot begin with the combination (1,0,1,1,0) [This follows 
from the fact that $5 = S, + $3 + S4]. 


We now proceed to show in Proposition 3 that the above conditions are sufficient 
to characterise the binary sequences that are associated with the canonical decom- 
position of a number. We first prove a lemma that is needed for the proof of this 
proposition. 

Lemma 2: Let 


m = Si + Siz +... + Si, with i; «i2 <... «iy 


be any decomposition of a number into Sarngadeva numbers and assume that the 
corresponding binary sequence of 0’s and 1’s satisfies the conditions (a)-(e). Then 


Si + Si d... Si, € Sil (14) 
Proof: We shall prove (14) by induction. We note that (14) is equivalent to 


An arp aee Sic) < Si,41 = Si, (15) 
(which we know is S;,—1 + Si, -3 + Si,-5)- 


If i;-1 < i; — 1, then by induction, 
Si; dr Si, T... Sio < Sic yl < Si—1 


and we are through with (15) and hence with (14). Suppose, i;_1 = i; — 1. Then, 
we can cancel both Siq) = S—1 from both sides of (15) and we are left to show 
that 


Si + Sin... + Sign < Si,-3 + Si, -5. (16) 
If i;-2 < i; — 3, then we are through by induction. Otherwise, since i;-2 < 
ijj < i; and ij—, = i; — 1, ij is either i; — 2 ori; — 3. If i42 = i; — 2, then 


since i;—1 = i; — 1, the combination (1, 1, 1) appears in the sequence, which is a 
contradiction. Thus, the only possibility is ij—5 = i; — 3, in which case, we can 
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cancel Si-a from both sides of (16) and are left to prove 
Si, + Si qPoce te yes < AES (17) 


Now, 77-3 < t;-2 = i; — 3. Suppose, i;-3 < i; — 5, then we are through by 
induction as before. Otherwise, we have i;-3 > i; — 5; then since i;_3 < i; = 
i; — 3, we have that i; 3 is either i; — 4 or i; — 5. If, i;-3 = i; — 4, then the binary 
sequence will have the combination (1, 1,0, 1, 1) which has been excluded by 
assumption. On the other hand, if i, 3 = i; —5, then the binary sequence will have 
the combination (1, 0, 1, 0, 1, 1) which has also been excluded by assumption. 
Hence the lemma is proved. 
Proposition 3: Let 


m= Si, + Si, +... + Si, with ij «i2 <... <i, (18) 


be any decomposition of a number and assume that the corresponding binary se- 
quence of 0’s and 1’s satisfies the conditions (a)-(e). Then the above decomposi- 
tion is the canonical decomposition of m. Equivalently, let 


m = Si, +Si,+...4+ Si, with ij «i2 <...<i; 
Sj, + Sj. +... + Sj, with Jj € ja € ... € jr (19) 


be any two decompositions of the number m, such that the corresponding binary 
sequences satisfy the conditions (a)-(e). Then the two decompositions in (19) are 
the same. 

Proof: If i; = j, , then we can cancel S;, = Sj, from both the decompositions 
and use induction to prove the proposition. Hence we can assume without loss of 
generality that i; < j-. Now, from the above lemma it follows that 

Sit t Sa t- +S, < Srt 

But, since i; < j, it follows that S;,41 < S;,. Hence, we get 

Si, + Si; +... + Si « Sj, which contradicts the assumption (19). Hence the 
proposition is proved. 

Corollary: The above propositions establish a bijection between the set of non- 
negative integers 0, 1, 2, ..., Sn — 1, which label the rows in a n-druta-prastara, 
and binary sequences of length n (ending with 0), which satisfy the conditions 
(a)-(e) stated above. 


M 3.8. The mathematical basis of the nasta and udd?sta process 


We enumerate the rows of a prastara of a total value n starting from the last row, 
RM numbering the last row as 0, the penultimate row as 1, and so on. If the rank of a 
: row in this numbering is t, then the number of the row in the usual enumeration 
(starting from the top and calling the top row as row number 1) is Sn — t. The 
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process of nasta consists in writing down explicitly the row (without the aid of 
the prastara), if one is given the row number m in the enumeration we have just 
described. If m = 0, then the row consists of n Ds. If m < S,—; then the last 
symbol of the row is D because the last S,—; rows of the n-druta prastara end 
with a D, by very definition. If S,-; < m < S,—, + $4—2, then the m-th row ends 
with an L. If S,-1 + $422 € m < S451 + Sp—2 + Sy—4, then the row ends with a 
G. If $, 1 + S422 + $424 € m < S451 + S42 + Sn—4 + Sn—6, then the row ends 
with a P. 

This leads one to the expectation that the canonical decomposition of m(1 < 
m < Sn), determines explicitly the sequence of symbols that occur in the m-th 
row. To show this, we proceed inductively as follows. 


(i) If the last symbol is D, that is n < $,—,, then the penultimate symbol of the 
m-th row is the last symbol of the m-th row in the prastara of value n — 1, 
where the last row has n — 1 drutas. 


(ii) If the m-th row ends with an L, that is S,-; < m < S,—1 + S,—2, then the 
penultimate symbol of the m-th row is the last symbol of the (m — 5, )-th 
row of the prastara of value n — 2. 


(iii) If the m-th row ends with a G, that is $4; + Sn-2 < m < Sn-1 + $422 + 
Sn—4, then the penultimate symbol of the m-th row is the last symbol of the 
(m — (Sn—1 + Sn—2))-th row of the prastara of value n — 4. 


(iv) If the m-th row ends with a P, that is S,-) + $422 + S424 € m < Sy) + 
Sn—2 + Sn—4 + $56, then the penultimate symbol of the m-th row is the last 
symbol of the (m — (Sp—1 + Sn—2 + Sn—4))-th row of the prastara of value 
n — 6. 


The next or the second penultimate symbol is then determined in the same way. 
And so on. This essentially establishes the following relation between the canon- 
ical.decomposition of the row number m and the sequence of symbols that occur 
in the row. 

Proposition 4: Given any number 0 < m < Sn let the corresponding canonical 
decomposition be obtained and re-expressed as a binary sequence of 0’s and 1's 
in the manner described earlier. Then, the symbols in the m-th row (enumerated 
from below starting from the last row numbered as 0) in the prastara of value n, 


are as follows: 
(i) If the sequence ends with (0, 0), then the last symbol is a D and the next 


symbol is the last symbol of the row whose number is given by the truncated 
binary sequence with the last 0 removed, in the prastara of value n — 1. 


(ii) If the sequence ends with (0, 1, 0), then the last symbol is an L and the next 
symbol is the last symbol of the row whose number is given by the truncated 


— 
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binary sequence with the end portion (1, 0) removed, in the prastara of value 
n —2. 


(iii) If the sequence ends with (0, 0, 1, 1, 0), then the last symbol is a G and the 
next symbol is the last symbol of the row whose number is given by the 
truncated binary sequence with the end portion (0, 1, 1, 0) removed, in the 
prastara of value n — 4. 


(iv) If the sequence ends with (0, 0, 1, 0, 1, 1, 0), then the last symbol is a P and 
the next symbol is the last symbol of the row whose number is given by the 
truncated binary sequence with the end portion (0, 1, 0, 1, 1, 0) removed, in 
the prastara of value n — 6. 


We now look at the converse process of uddista, where the objective is to de- 
termine the row-number (in the enumeration given above) given the sequence of 
symbols that occur in some row of the prastara of value n. Suppose the row- 
number is m and we want to determine m. Suppose, the last symbol is D, then 
m < S,—; and therefore S,—; does not occur in the canonical decomposition of 
m. If we delete the last D, we obtain a row of value n — 1, whose row-number t in 
the prastara (of value n — 1), can be determined by induction and we have m — t. 
Suppose the last symbol is L, then S,—; occurs in the decomposition for m, and 
after deleting L, we get row of value n — 2, whose row-number ¢ in the prastara 
(of value n — 2) can be determined by induction on n. We then set m = $4, +1. 
If the last symbol of the m-th row is G, then S,—; and S,—2 occur in the decom- 
position of m, and if we delete this G, we obtain a row of value n — 4, whose 
row-number ¢ in the prastara (of value n — 4) can be determined by induction 
and we set m = $,—, + $,—2 + t. If the last symbol of the m-th row is P, then 
Sn—1, Sn—2 and $,—4 occur in the decomposition of m, and if we delete this P, we 
obtain a row of value n — 6, whose row-number t in the prastàra (of value n — 6) 
can be determined by induction and then we set m = $4,—; + Sn—2 + Sn—4 + f. 
This is the process of uddista. 

We have outlined the theory of the canonical representation of a number as a 
sum of Sarhgadeva numbers (samkhyarikas Sn), which constitutes the mathemat- 
ical basis for the nasta and uddista processes discussed by Sarhgadeva. We shall 
now outline the processes of nasta and uddista as described by Sarhgadeva him- 
self. His rule of enumeration of tala-prastara also has an elegant mathematical 
description if viewed in terms of the canonical decompositions associated with 
two successive row-numbers. We shall discuss this in Appendix A. 


3.9. Nasta process as discussed by Sarngadeva 


aama Fa: eo Ga l 
aq aene 
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THE TIS STAT TEA phu 


AANA: y.392—-330) 


The nasta process is outlined by Sarngadeva in the above verses, which may 
be paraphrased as follows: In order to find the tala-form which is associated 
with the row-number r (nastarika) of a n-druta-prastara, write the sequence of 
samkhyankas S1, S2, ... Sn. Subtract the rank-number from $,. From the result 
(S, — r), subtract S,—1 if possible. If S„—ı can be subtracted, it is called reducible 
(patita), and then subtract S,—2 (if possible) from the result (S; — r — Sn—1) of 
this subtraction. If $4; cannot be subtracted from $, — r, it is called irreducible 
(apatita) and then subtract S,,—2 (if possible) from the number S, — r. And so on. 

Mark all the samkhyarikas 1,2... $,—, as either patita or apatita. Sn, being 
larger than r, is always apatita. Scanning the samkhyarika sequence right to 
left from Sn, obtain the successive talarigas of the tala-form from right to left, 
according to the following procedure. 


e We start with S, which is apatita. Then there are the following possibilities: 


e If the following samkhyarika Sn—ı is also apatita, then write a D corre- 
sponding to S, and proceed with scanning from S,—1. 


e If S,—1 is patita and $,—2 is apatita, then write an L corresponding to the 
two samkhyankas S, and $,—, taken together, and proceed to scan from 
$5—2. 


e If the samkhyarikas S,—1, Sn—2 are patita and S,—3 and S,—4 are apatita, 
then write a G corresponding to the four samkhyarikas Sn, Sn—1, Sn—2 and 
Sn—3 taken together, and start scanning from S5, 4. 


e If the samkhyanrkas Sn—1, Sn—2, Sn—4 are all patita and S,-3, S,—5 and 
Sn—6 are apatita, then mark a P corresponding to the six samkhyankas 


um Sn—1, Sn—2, Sn—3, Sn—4 and Sn—s5 taken together, and start scanning from 
n—6- 
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e Given the way the samkhyarikas are generated, the above four cases exhaust 
all the possibilities. 


e Proceeding thus, mark all the talarigas from right to left. 


In the above process of nasta for finding the tala-form associated with row r in 
a n-druta-prastara, Sarngadeva essentially gives the procedure for constructing 
what we have described as the canonical decomposition of the number $, — r. 
If we mark the patita samkhyarnkas by 1 and the apatita by 0, then following 
the procedure of Sarügadeva, we obtain the binary sequence that is associated 
with the number S, — r via its canonical decomposition. Then Sarngadeva’s 
prescription for identifying the talarigas that occur in the tala-form appearing 
in the r-th row is essentially the same as what we have stated as Proposition 4 
above (except for the fact that we have used an enumeration of the rows where 
the rows are numbered from 0 from the below, while Sarngadeva uses the usual 
enumeration of rows starting with row number 1 at the top of the prastara). 

The signatures of various talarigas as given by Sarngadeva are given in the 
"Table 11. Here, “0” stands for apatita and “1” for patita. Further, “(0)” (zero 
enclosed by brackets) indicates that the previous samkhyanka (if any) has to be 


an v uide for the identification to take effect. 


A 
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e Starting from 3, since 2 is apatita, we get a druta to the left. 
e Starting from 2, since 1 is apatita we get one more druta to the left. 
e Since | is apatita, we get one more druta, 
Thus the tala-form is DDDLL. 
The 28'" tala-form in the 7-druta-prastara 
33, 19, 10 and 6 are apatita and thus we get DDD from the right. Starting from 


6, 3 and 2 are patita and | is apatita. They give a G, thus giving the tala-form 
GDDD. 


oirpo[rpr[o[o[oT[0] 
s | 112[3]6 


3.10. Uddista process as discussed by Sarngadeva 


GTgldXee: y.3a9-332) 


Given a £ala-form associated with an n-druta-prastara, to find its rank-number 
the uddista process is employed. In the above verses, Sarngadeva essentially 
states that since uddista is the reverse of the nasta process, we write the sequence 
of samkhyankas and identify the patitas among them which give rise to the given 
sequence of talarigas. This effectively leads to the following process, which has 
also been discussed earlier in the general mathematical context. 

We write the samkhyankas 1, 2, ..., Sn above the £alarigas from the left. We 
write one samkhyarika above a D, two above an L, four above a G and six above 
a P. Since the samkhyarika associated with a D is apatita, it is not marked. Since 
the first samkhyarika associated with an L is patita and the second is apatita, 
only the first is marked. Since only the second and third of the samkhyarikas as- 
sociated with a G are patita, they are marked. Since, only the second, fourth and 
fifth samkhyarikas associated with a P are patita, they are marked. The sum of 
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the patita or marked samkhyarikas is subtracted from S, to get the rank-number 
of the tala-form. 


Examples 


To find the rank of LDLL in 7-druta-prastara 


(XT2T3 [6T 16 [19 [33] 
JCD T | 
' Total of the patita samkhyankas: 19 + 6 + 1 = 26. Rank: 33 — 26 = 7. 


To find the rank of GDL in 7-druta-prastara 


L112[3]6 10 | 19 | 33 | 
CRS DAL | 


M Total of the patita samkhyankas: 19 + 3 + 2 = 24. Rank: 33 — 24 — 9. 


To find the rank of P D in 7-druta-prastara 


|1|2|3[6]10|19|33| 
ner ere | DY 
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GTSKRGIPU 334-332) 


The patala sequence 71, 72,..., Tn, ... is defined inductively as follows: First 
write 1 (So = 1) followed by the sequence of samkhyankas S1, S2,..., $4,.... 
Below these the Patala sequence is to be set down. The n-th patala number (7; ) 
is obtained by adding the previous samkhyarika (Sn—1) to the previous, second- 
previous, fourth-previous and the sixth-previous patala numbers whenever they 
exist. That is, 


Tn = Sn-1 + Tn-1 + Tn-2 + Tn-4 + Tn-6 (20) 


In the case of patala sequence, there is no consideration of any representative in 
case the fourth or sixth previous elements do not exist (which was done in the case 
of samkhyarika sequence).!° 

The patala numbers Ta, T,-1, Tn—3 and T,—s5 give the total number of D, L, G 
and P which occur in the (various t@la-forms which are included in the) n-druta 
prastara. We can easily deduce this fact from the inductive construction of 
prastara outlined earlier. For instance, let D, be the total number of drutas in the 
n-druta prastara. The last S,—; rows which end in a D have $,—; + D4—, drutas. 
The number of drutas in the rows which end in L, G and P are D,—2, D,—4 and 
Dn—6 respectively. Hence, D, satisfy the recurrence relation 


Dn = Sn—1 + Dn=1 + Dn-2 + Dn—4 + Di-6 


which is the same as the recurrence relation (20) satisfied by the patala numbers 
Tn. The equality of D, and 7; follows from this and the fact that D; = T; for 
small values of i. 

Further 


Mn = Tn + Tn—1 + Tn—3 + Tn-5 = Tn+1 — Sn, (21) 


gives what is sometimes called the maha-patala, which is the total number of all 
talangas (D, L, G and P) which appear in the n-druta prastara. 

In Table 12, we give the patala sequence. Here, To = 0, Ti = 1, 73 = 1+1 = 
2, Tz =24+24+1=5, T4=3+5+2= 10, Ts =64+104+5+1=22, Tg — 
10 +22 + 10 +2 = 44, T; = 19 -- 44 4-22 4- 5 - 1 = 91, and so on. 


101m other words, we set 7; = 0, fori < 0. 
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3) BS oe Ee 8: T 9 ]-10 
3| 6 | 10/19 [| 33 | 60 | 106 | 169 
10 | 22 |44| 91 | 180 | 358 | 698 

7 | 16 | 34 | 72 | 147 | 298 | 592 | 1169 | 


TABLE 12. Patala sequence 


3.12. Druta-meru 


wA eren fast Ai aa AT di 
We easy agiis aAA | 


GI Idee: ¥.33¢-3819) 


After considering the pratyaya patala, Sarngadeva begins the discussion of the 
pratyayas druta-meru, laghu-meru, guru-meru and pluta-meru, which are 
analogous to the pratyaya eka-dvayadi-laga-kriya for matra-urttas. There, the 
druta-meruis to be constructed to compute the number of tala-forms in a prastara 
which have a given number of drutas. 

While considering an n-druta prastara, start with a first row which has n en- 
tries. The row above will not have any entry in the first column, the successive 
rows after that will have two columns less at each stage. In each row, the first two 
entries from the left will be 1, 1. In the first row, the odd places are filled with the 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


Combinatorial Methods in Indian Music 


TABLE 13. Druta-meru 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 2, 4, etc., drutas respectively when n is even, and the number of 
tala-forms which have 1, 3, 5, etc., drutas respectively when n is odd. 


sum of the previous, second- previous, fourth-previous and the sixth-previous en- 
tries as in the case of sazmkhyarikas. The even places are to be filled with the same 
sum except for the previous entry. Where the fourth or sixth-previous entry is not 
there, the third or the fifth-previous entry is to be used as the representative. In the 
other rows, the odd places are filled with the sum of the previous, second-previous, 
fourth-previous and the sixth-previous entries as in the case of samkhyankas. The 

even places are to be filled with the same sum except that the previous entry is to 

be taken from the row below. Where the fourth or sixth-previous entry is not there, 

no representative is to be used in their place. 

Then, the entries read from below in the n-th column give the number of tala- 
forms in the n-druta prastara with 1,3, 5, etc., drutas respectively, in the case 
when n is odd; they give the number of £ala-forms in the n-druta prastara with 
0, 2, 4, etc., drutas respectively, in the case when n is even.!! Notice that the sum 
of all the entries in any column is always equal to the corresponding samkhyanka. 

In Table 13, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prasta@ra. The rows of the meru are filled as follows. 


e First row odd entries: 1 = 1, 2= 1+1, 5—2-4-24 1, 12-44-5424 
1, 26— 74-12-5402 

e First row even entries: 1 = 1, 2— 1+1, 4=24+1+41, 7=4+4+2+41 
(in second and third sums 1 is the pratinidhi or the representative) 


e Second row odd entries: 4 = 3+1, 14 2 9--4-4- 1, 44 —25--14 3-441 


Ilo; 
Since L, G and P correspond to kàla-pramànas 2,4 and 6 which are all even, a n-druta 
zprastara will have rows which contain only odd number of Ds when n is odd, and only even number 
of Ds when n is even. 
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e Second row even entries: 3 224-1, 9— 54-341, 25 — 124-9 4-341 


For example, in the 5-druta prastara, there are 5 tala-forms with 1 druta, 4 

with 3 drutas, and 1 with 5 drutas. In the 6-druta prastüra, there are 4 tāla- 

Af forms with 0 drutas, 9 with 2 drutas, 5 with 4 drutas and 1 with 6 drutas. In the 

M i1 7-druta prastara, there are 12 tala-forms with 1 druta, 14 with 3 drutas, 6 with 
i 5 drutas and 1 with 7 drutas. 

The above construction of the druta-meru is in fact à generalisation of the 
construction of the matra-meru used in the eka-dvayadi-lagakriyà for the matra- 
vrttasin the Prakrta Pairigala (in Prakrta, c. 1300) or the commentary Narayanit 
f of Narayanabhatta (c.1555) on Vrttaratnākara of Kedarabhatta. There, the 
i mer is constructed essentially by a recurrence relation of the type g} = gy lae 


] gi 2 , where gj is the number of metrical forms in the prastara of n-matras with k 


h *" gurus. These can be shown to be related to the binomial coefficients and their sum 
ATEM. for any given n will be related to the samkhyarikas or the Virahanka-Fibonacci 
j ; numbers.!? 


Let Dj denote the number of tala-forms with k drutas in the n-druta prastara. 
ia) If we refer back to the inductive construction of the prastāra explained earlier, we 
IL mi can easily see that of these tāla-forms, Dj} will be the number of tāla-forms 
which end in a druta, while Diza IDR and DE will be the number of tala- 


forms which end in a laghu, ita and Puno respectively, Thus we have the recur- 
rence relation: JE 


D, = Dj *Dp* +D *+D} S  (fon»62andk»0) Q2 
Ux 


Ait Vi = 


The above recur Ee satisfied by D", is in fact the basis for the above 


Bros we first note that the coefficient 
icd number of possible tala-forms of value 


Q3) 
ver eli 


Teds ae 


; — 55 BRE J 
bd EL ou ieee 
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We therefore have 


[1 — (x +x? + x* x9) 7 > ope | =1 (24) 


n cn 


Comparing the coefficients of t*x” on both sides of the above equation, we obtain 
the recurrence relation (22). Incidentally, by setting t = 1 in (23), we obtain the 
obvious relation 


» DS. (25) 


k<n 


The recurrence relations (22) are not very transparent in the form in which druta- 
meru of Sarügadeva is laid out in Table 13, because the same row corresponds 
to k or (k + 1) drutas in alternate columns. In Appendix C, we shall display an 
alternative form of druta-meru proposed by Akella Mallikarjuna Sarma, which 
clearly reflects the above recurrence relation. 


3.13. Laghu-meru 


GTI: Y.389-34?) 


The laghu-meru is to be constructed to compute the number of tala-forms in a 
prastara which have a given number of laghus. 

Like in the druta-meru, while considering an n-druta prastara, start with a 
row which has n entries. The row above will not have any entry in the first column, 
the successive rows after that will have two columns less at each stage. In each 
row, the first entry from the left will be 1. In the first row, the places are filled 
with the sum of the previous, the fourth-previous and the sixth-previous entries 
in the same row. Where the fourth or sixth-previous entry is not there, the third 
or the fifth-previous entry is to be used as the representative. In the other rows. 
the places are filled with the sum of the previous, fourth-previous and the sixth: 
previous entries in the same row, and the second-previous entry in the row below. 
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ES 
| [a] 213] 4 | 7] 12 21 |34 | 54 | 
[1]1/1[2]3|5]| 7 | 10] 14] 21 | 
| [1/2/3[4(5:6/| 7 | 8 [| 9 [10] 


TABLE 14. Laghu-meru 


i The different columns correspond to prastaras of different values which 

are indicated in the bottom row. The entries in the successive rows above 

the bottom row in the n-th column give the number of ta@la-forms which z 
have 0, 1, 2 etc., laghus respectively. 


! 
t 
| 
V 1 | Where the fourth or sixth-previous entry is not there, no representative is to be 
! used in their place. 

- Then, the entries read from the bottom in the n-th column give the number 
of tala-forms in the n-druta prastara with 0, 1, 2, 3, etc., laghus respectively. 
| The sum of all the entries in any column is always equal to the corresponding 
-samkhyankas. 

In Table 14, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prastara. The rows of the meru are filled as follows. 
j =1, 1=1, 1=1, 2=1+41, 3=2+1, 5=3+1+4+1, 7= 
+2+1, 14=10+3+1, 21 = 14+ 5 + 2 (in the fourth and 


E 423-1, 7242-1, 2= 
: 34= 21474442, 54 = 344 10+7+3. 
ere are 5 tala-forms with 0 laghus, 7 
aghus. In the 7-druta prastara, there 
h 1 Leoni 10 with 2 laghus and 4 with 3 


re er 


slogous sis e). 
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which is clearly the basis for the construction of the laghu-meru as described 
above. 


3.14. Guru-meru 


GT 3 dele: ¥.399-348) 


The guru-meru is to be constructed to compute the number of tala forms in a 
prastara which have a given number of gurus. 

While considering an n-druta prastara, start with a row which has n entries. 
The row above will have three columns less and the subsequent rows will have four 
columns less at each stage. In each row, the first entry from the left will be 1. In the 
first row, place 2 at the second place. The other places are filled with the sum of the 
previous, second-previous and the sixth-previous entries. When the sixth-previous 
entry is not there, the fifth-previous entry is to be used as the representative. In 
the other rows, the places are filled with the sum of the previous, second-previous 
and the sixth-previous entries in the same row, and the fourth-previous entry in the 
row below. Where the sixth-previous entry is not there, no representative is to be 
used in its place. 

Then, the entries read from the bottom in the n-th column give the number 
of tala-forms in the n-druta prastara with 0, 1, 2, 3, etc., gurus respectively. 
The sum of all the entries in any column is always equal to the corresponding 
samkhyanka. 

In Table 15, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prastara. The rows of the meru are filled as follows: 

First row: 1 = 1, 2 = 2, 3 = 2+1, 5=3+2, 8 = 5+3, l4 = 
8+5+1, 23 = 14+8+1, 39 = 23+14+2, 65 = 394-23--3, 109 = 65--39--5 
(in the sixth entry 1 is the pratinzdh:). 

Second row: 2 = 1+1, 5 —24-1--2, 10 = 54-24-3, 20 = 104-54-5, 38 = 
20+ 104-8, 73 2 38 -- 20 4- 14 4 1. 

For example, in the 6-druta prastara, there are 14 tala-forms with 0 gurus and 
5 with 1 guru. In the 9-druta prastara, there are 65 tala-forms with 0 gurus, 38 
with 1 guru and 3 with 2 gurus. 
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MERIT [T [319 | 
| | | [1|2| 5 | 109] 20] 38] 73 | 
[1] 2] 3/5 [8] 14 | 23 [39] 65 | 109 | 
H i[1/[2[3[4|[5] 67 [8 [| 9 [ 10 | 


b TABLE 15. Guru-meru 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 


| I the bottom row in the n-th column give the number of tala-forms which 
1 have 0, 1, 2 etc., gurus respectively. 
! 
f 
| 
i 
T 


1 If Gj denotes the number of tala-forms with k laghus in the n-druta prastara, 
ml then we can easily see that we have the following relation analogous to (23): 
1*4 
Inm i U — (x 4 x? + tx* 4 x67! = pa or ce (28) 
it | n Lk<n 
m 


From (28) we obtain the following recurrence relation 


Gi —G,-GL?-GLT- GL —(fon»6andk»0) (29) 


which is clearly the basis for the construction of the guru-meru as described 
above. 


t TOOS XE 


which has n 
quent rows 
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the fourth-previous entry is not there, the third-previous entry is to be used as 
the representative. In the other rows, the places are filled with the sum of the 
previous, second-previous, and the fourth-previous entries in the same row, and 
the sixth-previous entry in the row below. Where the fourth-previous entry is not 
there, no representative is to be used in its place. The sum of all the entries in any 
column is always equal to the corresponding sarmkhyarika. 


Paes 2 
[1|2]3 fe [1018 | 3T [ 55 | 96 | 169 | 296) 
CEL ieoi o] 


TABLE 16. Pluta-meru 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 1, 2 etc., plutas respectively. 


In Table 16, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prastara. The rows of the meru are filled as follows. 

First row: 1 = 1, 2= 2, 3224-1, 6234-24 1, 10=6+3+!1, 18 = 
10--6--2, 31 — 184- 104-3, 55=31+18+6, 96 =55 +31 + 10, 169 — 
96 + 55-- 18, 296 = 169 + 96 +31 (in the fourth entry 1 is the pratinidhð. | 

Second row: 2= 1 +1, 5—24- 14-2, 102 5424-3, 22=10+5+ 
6, 44 = 22+ 10+2+10. 

For example, in the 6-druta prastara, there are 18 tala- 
l with 1 pluta. In the 9-druta prastara, there are 96 tala-forms W. 
10 with 1 pluta. tano, 

If P? denotes the number of tala-forms with k plutas in the nade” 
then we can easily see that we have the following relation analogous 


d 
s with O plutas an 
form ith 0 plutas, and 


(30) 
n 
[1 — € x? xt txs = 2 xp 
n «n 
From (30) we obtain the following recurrence relation 31) 
( 
r 0). 
n. 1 —2 7 
Pk = Tete + Piet e pure (for n > 6 and k ‘ped 
as des" 


which is clearly the basis fo 


above. r the construction of the pluta-™ 
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| 3.16. Other Pratyayas 


After the discussion of the druta, laghu, guru and pluta-merus, Sarngadeva goes 
on to consider ten more pratyayas, namely the samyoga-meru, khanda-prastara 
and the nasta and uddista processes associated with the druta, laghu, guru and 
pluta-merus. We shall here merely indicate the nature of these pratyayas, which 
have been dealt with in considerable detail by Sarngadeva in over fifty verses 
(verses 357-409) of Chapter V of Sarigitaratnakara. 

The samyoga-meru gives the number of tala-forms in a prastára which have 
f only D, or only L, or only G, or only P, or only D and L, or only D and G, and 
B so on. This involves a consideration of all the 15 such combinations involving D, 
} L, G and P—taken individually, in pairs, in triples and all the four together. 

The next pratyaya, khanda-prastara, is in fact a sub-prastara of a given n- 
n druta prastara, and has only those tala-forms which conform to given specifica- 
d tion, such as having no drutas, or one druta, or two drutas, etc., and similarly for 
laghus, gurus and plutas. Finally, the processes of nasta and uddista, for such 


a khanda-prastara, are discussed by Sarngadeva in terms of the corresponding 
merus. 
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Appendices 


Appendix A. Canonical decomposition and the rule of enumeration of 
Tala-prastara 


In this Appendix, we shall explain how Sarügadeva's rule of enumeration of the 
tala- prastara has an elegant mathematical description if viewed in terms of what 
we have referred to as the canonical decomposition of a row number as a sum of 
the samkhyarikas Sn or Sarngadeva numbers. We have already noted earlier (in 
our discussion of the nasta process) that the m-th row in the n-druta-prastara is 
obtained by taking the canonical decomposition of S, — m, writing 1 for all the 
patita numbers (< S,,) and O for the others, substituting P for the string 010110, 
G for 0110, L for 10 and D for the string consisting of the single symbol 0. The 
next lemma shows that for m = Sn — 1, this actually leads to the first row of the 
prastara as described by Sarngadeva. 


Lemma: The canonical decomposition of S, — 1 leads via the process of nasta 
to the first row of the n-druta-prastara, which has the maximum number of Ps at 
the extreme right followed by a DG or G or DL or L ora D on the left, depend- 
ing on whether n = 1, 2,3,4,5 (modulo 6). The first row has only Ps if n = 0 
(modulo 6). 


Proof: If n = 1, Sn — 1 = 0 and the first row of the 1-druta-prastara is just D. 
If n = 2, S,—1 = 1 = 1.S;+0.S and hence the first row of the 2-druta-prastara 
corresponds to the sequence 10 and is therefore an L. If n= 3,Sn-l=2= 
0.5; + 1.S2 + 0.53, hence the first row of the dii eee corresponds to the 
sequence 010 and is hence DL. If n = 4, S,-L= 6— = pes 0.51 +1.52+1.53+ 
0.54, hence the first row of the 4-druta-prastara E Ue to the sequence 0110 
and is hence G. If n = 5,$,—1—10—1—9 = 0. odé PA de 1.53+ dn 
hence the first row of the 5-druta-prastara ue 4L 0.5, Hs oS FA x 
DG. If n = 6, Sy — 1 = 19 — 1 = 18 = 0-515 nds to the sequence 010110 
hence the first row of the 6-druta-prastara CO 


and is hence P. 
Let now n > 6. We have 


+ Sn-2 g 
~ x + Sn-4 n—l 
Sn = Sn—6 c Sn-4 EE Sna EE (32) 
so that Sn -1 = Sn-6 = 
"m 


with n — 6 > 0. We may assume by induct e (m — ^ ano, ita-prastara- In view 
sition for Sn- — 1 leads to the first row of th Qo P? Aica} gecomposition of 


of the above equation, the row correspon 
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Sn — 1 is obtained by adding only one more P to the right of the first row of the 
(n — 6)-druta-prastàra and our lemma is thus proved. !* 

We shall now see that the above lemma can be used to relate the rule of Sarhga- 
deva for passing from the m-th row of the n-druta-prastara to the (m + 1)-th row, 
via the change in the canonical decomposition of Sp — m to that of $, — (m + 1). 

Suppose that the first non-druta element (from the left) of the m-th row is a P. 
Then the canonical decomposition of S, —71 has the binary form00...010110.... 
Let us assume that the first 1 in the above string occurs at the k-th place (counted 
from the left), i.e., that the canonical decomposition of S, — m begins with a Sx. 
Therefore, in the canonical decomposition of $, — (m+ 1), the 1 in the k-th position 
in the canonical decomposition of $,, — m changes to a zero and, in order to obtain 
the canonical decomposition of $, — (m + 1), we need to know the canonical 
decomposition of S — 1, which has been obtained in the above lemma. Thus we 
see that while going from the m-th to (m + 1)-th row, the P (above) changes to 
a G and we have to insert, from right to left, as many Ps as possible followed by 
DG, or G or DL or L or D or none of them as the case may be. The symbols 
to the right of P in the m-th row remain unchanged in the (m + 1)-th row. This 
is precisely the rule given by Sarngadeva for going from a given row to the next 
in the prastara, when the first non-druta element in the row (going from left to 
right) happens to be P. If the first non-drutaelement in the m-th row of a prastara 
happens to be G or L, we can use a similar argument to show that in the next row 
this symbol is changed to L or D respectively, and the rest of the symbols of the 
row on the left are obtained exactly as given by the rule of Sarngadeva, and those 
on the right remain the same. 

Let us illustrate this by an example. The 15-th row (from the top) of the 7- 
druta prastara with 33 rows (see Table 6) is P D and corresponds to the canonical 
decomposition 


iH $;—15 = 33-15=18=2+6+10 
H = 0.8; + 1.82 + 0.53 + 1.84 + 1.85 + 0.S6 + 0.57 


14We may note that, in the case of matra-vrttas, the above lemma which gives the first TOW of a 
prastāra of value n translates itself into the following simple identities between Virahanka-Fibonacci 
numbers (see the article of R. Sridharan, cited earlier, p. 133): 


ll 


sak —1 (ifn = 2k) 
S2k4] —1 (ifn = 2k + 1) 


Sy +53 4+...4+ 524-1 
$2 +54 +... +52 
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This corresponds to the 7-tuple (0, 1, 0, 1, 1, 0, 0). The next row is associated with 
the canonical decomposition 


57-16 = 33—16=17=1+6+10 
= (S2 —1)4+ 0.83 + 1.84 + 1.85 + 0.86 + 0.57 
1.81 + 0.S2 + 0.83 + 1.84 + 1.85 + 0.86 + 0.57 


This corresponds to the 7-tuple (1, 0,0, 1, 1, 0, 0) and represents the row LGD 
(see Table 6). That is, P is replaced by LG and the symbol D to the right of P 
remains unchanged. 

In Table 17, we have listed the 33 binary sequences of length 7 that correspond 
to the rows of the 7-druta-prastara. We note that the last row whose entries are all 
0 corresponds to the canonical decomposition of 0. The row above it is (1, 0, ..., 0) 
is got by increasing the first zero of the last row by 1. This corresponds to replacing 
Sı — 1 = 0 by Sı = 1. In fact, in passing from the m-th row to the (m + 1)-th 
row from below, a certain zero entry becomes 1 and the rest of the entries to the 
left become 0 and entries to the right are unchanged. This, indeed, corresponds 
to replacing the canonical decomposition of S — 1 for some k < n (in a n-druta 
prastara), by S,. 

For instance, in Table 17 below (for n = 7), the passage from 27th row 
(counted from below) which corresponds to the binary sequence (0, 1, 0, 1, 0, 1, 0) 
to the 28th, the sequence in question gets replaced by (0, 0, 1, 1,0, 1, 0) which 
corresponds to the replacement of 55 — 1 by $3. In the same way, the pass- 
ing from the 15th row (counted from below) which corresponds to the binary 
sequence (0, 1, 1, 0, 1, 0, 0) to the 16th, the above sequence gets replaced by 
(0, 0, 0, 1, 1, 0, 0), which corresponds in turn to the replacement of $4 — 1 by 
$4. 

It is interesting to note that the process is analogous to the rule for going from 
one row to the next in the case of tana prastara, where it is discussed in terms of 
the factorial representation (see Proposition 2, Sec. 2). 

We shall now indicate a method of constructing the binary sequences corre- 
sponding to the rows of the prastara of value n, assuming that such a description 
is possible for the prastara of value n — 1. To do this, we first show how the n- 
druta prastara itself can be obtained through the knowledge of the (n — 1)-druta 
prastara. 

As we have remarked earlier, the prastara of value n is obtained by first writ- 
ing down the prastara of value n — 6, augmented by P, adding below this the 
prastara of value n — 4 augmented by G, adding below this the prastdra of value 
n — 2 augmented by L and finally adding below this the prastara of value n — 1 
augmented by D. We will see how the prastaras of values n — 2, n — 4 and n — 6 
sit inside the prastara of value n — 1. If we take the S,_2 rows of the prastara of 
value n — 1 ending with D and omit the last D, we obtain the prastara of value 
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\ m a | | SASA Ss ESA S] ESE E S7-m 
(1) | (2) | (3) | (6 | (10) | (19) | (33) | (33-n) 
iy DEEP. OF Os eae aR 1 1 0 EN 
D|r|[G Osi pati On -07| 1 1 0 31 
L[D|G KEA Ub 0 30 
D|D|[D|G Tmo = In| 0 29 | 
5 D|G|L 8j | Ole le 0 l 0 28 
Ht JL Dibaba ms DEBER 0 a [707] 1 0 27 
zi L|D|L[L TO] [ess a ee EE 26 
DIRAI o| of] of 1 0 I o[ 25 
See G|IDIL VIL |= ew ao I 0 24 
E iC TES |e 1| of 1 0 l 0 23 
[58 —[5 AD Oo; of 1 0 I 0 22 
re Eeqpepe pps @ | eo WE zou Es dg 21 
L3] ] [r[|P|D]| LSU EDIE Cie Eo 20 
Ref L o| of o 0 ae 0 19 
EEDL || es | S| EDD 2E) 0| 1| 0 TH Or sro 18 
116 a (| |e SS zH ofj o 1 0 17 
Reet He 0| 0| 0 l of 16| 
iA ia | | a l] rh — 1 0 15 
LOT SD i Oe DCN RR 0 14 
Bu iz JE ]-]E-39E xd od Et 0 13 
0 dE 3D Maya $m ON ej 0 0 12 
Le rS EDI eee i JOE E ESSE OE 11 
Ge EE ee Oo; of Of o I 0 0 10 
EX EET ERR [ld G|D|D|] | o| of 1 I 0 0 0 9 
[2] | [p|r|r[p|p]| | o| 1| of 1 of o0 8 
[zc] | |r|p|r|p|p]| | 1| o| of 1 =a 0| 0| 7 
|27| |p|p|]p|r|p|p|] 0| of of 1| o0 0 6 
BZ Se prog po D» ee RC 0 E 
Sot tote tee te eH Lag mE 0 4 
30| |p[D|L[|D]|D]|D] | Ea eq eg Ede 
|j3] |Jp|r|p|]p|p|D| | o| r| of of of of o 2 
[2|] |r|p|p|p|p]D]|] | 1| of of of of of of 1| 
' Bojijo | oj of of of of of oj 9| 


TABLE 17. Canonical decomposition of rows of Seven-druta-prastara 


n — 2. If we take the S,,4 rows of the prastara of value n — 1 ending with DL 
and omit the last two entries DL from the rows, we obtain the prastara of value 
n — 4. This is because, if we take the rows of the prastara of value n — 1 ending 
with L and omit the last entry L, we obtain the prastra of value n — 3; now, if we 
take the rows of the prastara of value n — 3 ending with a D and omit this D, we 
obtain the prastara of value n — 4. It follows by a similar argument that if we take 
the $,—6 rows of the prastara of value n — 1 ending with DLL and delete these 
three entries from the rows, we obtain the prastara of value n — 6. From the above 
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remarks, we deduce the following: 


Proposition: The prastara of value n is obtained from the prastara of value 
n — | as follows: 


(i) We take all the rows of the prastara of value n — 1 and augment these by a 
D at the end. These form the last $,--; rows of the prastara of value n. 


(ii) We take the rows of the prastara of value n — 1 ending with a D and replace 
this D by an L. These form the next S;,—2 rows of the prastara of value n 
occurring above the rows constructed in (i). 


(iii) We take the rows of the prastara of value n — | ending with DL and replace 
the last two entries DL of these rows by a G. These form the next S, 4 rows 
of the prastara of value n occurring above the rows constructed in (ii). 


(iv) Finally, we take the rows of the prastara of value n — 1 ending with DLL 
and replace the last three entries DLL of these rows by a P. We then obtain 
the first S,—6 rows of the prastara of value n. 


In a similar vein, we can construct the binary sequences of length n associated with 
the prastara of value n, assuming that we have already constructed the binary se- 
quences of length n — | associated with the prastara of value n — 1. To carry 
out this construction, we need to write down the canonical decomposition of any 
number less than S, assuming that we have written down the canonical decompo- 
sition of any number less than S„—1. Now, any integer less than $, is either less 
than 5$,—; or of the form S,—; +i where O <i < Sn — Sy-1. Since 2Sn—-1 > Sn, 
we have i < S,—,. Therefore we can write down the canonical decomposition of 
any number less than S, (assuming that we have written down the canonical de- 
composition of any number less than S„—1). This leads to the following procedure 
for the construction of the binary sequences of length n that we are looking for. 

In order to write down the binary sequences of length n associated with the 
prastara of value n: 


(i) We take all the binary sequences of length n— 1 corresponding to the prastara 
of value n — 1 and augment these sequences on the right by 0. These form 
the binary sequences corresponding to the last S„—ı rows of the prastara of 
value n. This amounts to augmenting each of the rows of the prastara of 
value n — 1 by D. 


(ii) The binary sequences associated with the first Sn — S, | rows of the prastara 
of value n are obtained as follows: We take the binary sequences associated 
with the last Sn — S,—1 rows of the prastara of value n — 1 and replace 
the last 0 of these rows by the string 10. We can verify that this procedure 
amounts to taking the rows of the prastára of value n — | ending with a D 
and replacing this D by an L, taking the rows of the prastara ending with 
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TABLE 18. Unmerufor Tala Prastara 


f ! DL and replacing the DL by G and finally taking the rows of the prastara 
jM ending with DLL and replacing the DLL with a P. 


HEN Remark: It is interesting to see what happens if we apply the procedure out- 
lined in (ii) to all the binary sequences associated with the prastara of value n — 1. 
For example, if n = 8, Sg — $; = 60 — 33 = 27. We have already seen in (ii) 
what happens if we take the last 27 rows of the 7-druta prastara and replace the 
last 0 of these sequences with the string 10. Now, if we take binary strings associ- 
ated with the first six rows of the 7-druta prastara and replace the last O of these 
sequences with the string 10, then we see that in each case one of the conditions 
(a)-(e) (mentioned in Proposition 3 of Sec. 3) will be violated. 


E nd B. Muret unmeru method for nasta and uddista 


n his Ganita-kaumudi (c. 1356) discusses the nasta and 
s generalization of the màtra-vrtta-prastara. For this pur- 
fers to as the unmeru. Though Narayana Pandita's 
astra does not subsume the tala-prastara as 


o and uddista. 
ethod for nasta and uddista by 
ra. To construct the unmeru, in 
y the samkhyarikas up to 33, 
column a ye the samkhyarikas, 
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B.1. Nasta-vidhi 


Given any number less than or equal to 33, find the set of patita and apatita 
samkhydnkas as usual. Starting from the right, note the entry in the first row 
above the first apatita (if there is no entry go to the next apatita along the same 
row). In the row above the top most entry of the column of that apatita, move left 
till you reach the column of the next apatita. Note the corresponding entry (or 
move on in the same row till the next apatita, if there is no entry). And so on. 


Ex. 1: To find the 8" tala-form in the 7-druta prastara. 


The apatita are 33, 10, 3, 2, 1, 1. We get the tala-form 11122 or DDDLL. 


Ex. 2: To find the 15'" tala-form in the 7-druta prastara. 


33, 19, 3, 1, 1 are apatita. We get the tala-form 61 or PL. 


Ex. 3: To find the 28'^ tala-form in the 7-druta prastara. 


33, 19, 10, 6, 1, 1 are apatita. We get the tala-form 4111 or GDDD. 
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Ex. 4: To find the 2”? tala-form in the 7-druta prastara. 


33, 6, 3, 1, 1 are apatita. We get the tala-form 124 or DLG. 

Thus, we see that the nasta-process becomes somewhat simpler when we make 
use of the unmeru. Since the uddista-process is quite simple in the standard 
formulation itself, there is no need to discuss the process in terms of the unmeru. 


E Appendix C. Druta-meru proposed by Akella Mallikarjuna Sarma 


The druta-meru described in the Sarigitaratnakara and displayed explicitly in 
commentaries, does not render transparent the recursion relation 


Dg = Diz} + Di? + Di DD 


satisfied by D; (the number of tala-forms with k-drutas in the n-druta-prastara). 
As is clear by its construction, the successive columns of the druta-meru corre- 
spond to 1-druta, 2-druta etc., prastaras. But, as we noted earlier, the successive 
rows have a different interpretation, depending on whether we are considering odd 
or even columns. The row above the bottom row alternatively gives the number of 
a-forms with 1 druta and 0 druta; the next row alternatively gives the number 

1 -forms with 3 drutas and 2 drutas, and so on. 


proposed by ju Mallikarjuna Seana (see for 
c | critic werpretation, Hyderabad 2001, 
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TABLE 19. Druta-meru of Sargitaratnakara. 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 2, 4, etc., drutas respectively when n is even, and the number of 
tala-forms which have 1, 3, 5, etc., drutas respectively when n is odd. 


BeBe 
ots oto] ojo oja] 
Bo E o|44| 0|125| 0 | 336 | 
ath 25| 0/63] 0] 153] 0| 
| 56] 0] 118 | 


TABLE 20. Alternative form of Druta-meru due to Akella 
Mallikarjuna Sarma. 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 1, 2 etc., drutas respectively. 
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What’s so Baffling About Negative 
Numbers? — a Cross-Cultural 
Comparison 


David Mumford 


I was flabbergasted when I first read Augustus De Morgan's writings about negative 
numbers!. For example, in the Penny Cyclopedia of 1843, to which he contributed 
many articles, he wrote in the article Negative and Impossible Quantities: 


It is not our intention to follow the earlier algebraists through their different uses 
of negative numbers. These creations of algebra retained their existence, in the 
face of the obvious deficiency of rational explanation which characterized every 
attempt at their theory. 


In fact, he spent much of his life, first showing how equations with these meaningless 
negative numbers could be reworked so as to assert honest facts involving only 
positive numbers and, later, working slowly towards a definition of abstract rings 
and fields, the ideas which he felt were the only way to build a fully satisfactory 
theory of negative numbers. 

On the other hand, every school child today is taught in fourth and fifth grade 
about negative numbers and how to do arithmetic with them. Somehow, the aversion 
to these ‘irrational creations’ has evaporated. Today they are an indispensable part 
of our education and technology. Is this an example of our civilization advancing 
since 1843, our standing today on the shoulders of giants and incorporating their 
insights? Is it reasonable, for example, that calculus was being developed and the 
foundations of physics being laid — before negative numbers became part of our 
numerical language!? 

The purpose of this article is not to criticize specific mathematicians but first 
to examine from a cross cultural perspective whether this same order of discovery, 
the late incorporation of negatives into the number system, was followed in non- 
Western cultures. Then secondly, I want to look at some of the main figures in 


——E 

'De Morgan's attitudes are, of course, well known to historians of Mathematics. But my naive idea 
as a research mathematician had been that at least from the time of Newton and the Enlightenment an 
essentially modern idea of real numbers was accepted by all research mathematicians. 
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Figure 1. Augustus De Morgan 


Western mathematics from the late Middle Ages to the Enlightenment and examine 
to what extent they engaged with negative numbers. De Morgan was not an isolated 
figure but represents only the last in a long line a great mathematicians in the West 
who, from a modern perspective, shunned negatives. Thirdly, I want to offer some 
explanation of why such an air of mystery continued, at least in some quarters, to 
shroud negative numbers until the mid 19" century. There are several surveys of 
, i similar material? but, other than describing well this evolution, these authors seem 
to accept it as inevitable. On the contrary, I would like to propose that the late 
acceptance of negative numbers in the West was a strange corollary of two facts 
which were special to the Western context which I will describe in the last section. I 
am basically a Platonist in believing that there is a single book of mathematical truths 
that various cultures discover as time goes on. But rather than viewing the History 
of Mathematics as the unrolling of one God-given linear scroll of mathematical 
results, it seems to me this book of mathematics can be read in many orders. In the 
long process of reading, accidents particular to different cultures can result in gaps, 
areas of math that remain unexplored until well past the time when they would have 


"Three references are (i) Jacques Sesiano, The Appearance of Negative Solutions in Medieval 
Mathematics, Archive for History of the Exact Sciences, vol. 32, pp. 105-150; (ii) Helena Pycior, 
Symbols, Impossible Numbers and Geometric Entanglements, Cambridge Univ. Press, 1997; (iii) Gert 
Schubring, Conflicts between Generalization, Rigor and Intuition, Springer 2005. 
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been first relevant. I would suggest that the story of negative numbers is a prime 
example of this effect.? 

This paper started from work at a seminar at Brown University but was developed 
extensively at the seminar on the History of Mathematics at the Chennai Mathemat- 
ical Institute whose papers appear in this volume. I want to thank Professors P. P. 
Divakaran, K. Ramasubramanian, C. S. Seshadri, R. Sridharan and M. D. Srinivas 
for valuable conversations and tireless efforts in putting this seminar together. On 
the US side, I especially want to thank Professor Kim Plofker for a great deal of 
help in penetrating the Indian material, Professor Jayant Shah for his help with both 
translations and understanding of the Indian astronomy and Professor Barry Mazur 
for discussions of Cardano and the discovery of complex numbers. I will begin 
with a discussion of the different perspectives from which negative numbers and 
their arithmetic can be understood. Such an analysis is essential if we are to look 
critically at what early authors said about them and did with them. 


1. The Basis of Negative Numbers and Their Arithmetic 


It is hard, after a contemporary education, to go back in time to your childhood 
and realize why negative numbers were a difficult concept to learn. This makes it 
doubly hard to read historical documents and see why very intelligent people in the 
past had such trouble dealing with negative numbers. Here is a short preview to try 
to clarify some of the foundational issues. 

Quantities in nature, things we can measure, come in two varieties: those which, 
by their nature, are always positive and those which can be zero or negative as well 
as positive, which therefore come in two forms, one canceling the other. When one 
reads in mathematical works of the past that the writer discards a negative solution, 
one should bear in mind that this may simply reflect that for the type of variable in 
that specific problem, negatives make no sense and not conclude that that author 
believed all negative numbers were meaningless*. Below is a table. The first five are 
ingredients of Euclidean mathematics and the sixth occurs in Euclid (the unsigned 
case) and Ptolemy (the signed case, labeled as north and south) respectively. 

What arithmetic operations can you perform on these quantities? If they are 
unsigned, then, as in Euclid, we get the usual four operations: 


l. a+bOK 


2. a — b but only ifa > b (as De Morgan insisted so strenuously) 


31 believe the discovery of Calculus and, especially, simple harmonic motion, the differential equa- 
tions of sine and cosine, in India and the West provide a second example. 

4For example, Bhaskara II has a problem in which you must solve for the number of monkeys in 
some situation, and obviously this cannot be negative. 
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| TABLE I 
Pd 


t 

i Eurum | "enr [| oane — | 
ton Modern units Quantities Signed Quantities 
positive integer # of people/monkeys/ 
um O o —— — 
Pa (Euclid, Bk V) 

128 length of movable rigid 

H NEN  — 


| HE lu s area of movable rigid flat 

1 r1 object 

d j y volume of movable rigid 

BI]! object or incompressible 

n degrees (of angle)_| Measure ofa plane angle | distance N/S of equator | 

TET dollars fortune/debt; profit/loss; 
hr COUTE 
Pe? 4 


(a) distance on line/road, rel. 
to fixed pt, the ‘number line’ 
(b) also, height above/below 
the surface of earth. 
time before or after the 

present or relative to a fixed 
event 


your weight on a scale 
= force of gravity on your 
body (a vector) 


ferent from those of the arguments, — — 
n; length — volume v 
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TABLE Il |; 
First Second 
summand | summand Sum 
a b usual a + b a—bifa»b 
(neg)(5b — a)ifb > a 
(neg)a (neg)b (neg)(a + b) b—aifb»a 
(neg)(a — b) ifa > b 
a (neg)b a—bifa-b 
(neg)(b — a)ifb > a 
(neg)a b | b-aifb>a 
| (neg)(a — b) ifa > b 


We write the simple expression a — b, and consider it obviously the same as any of 
these: 


a+(—b)=a —(+b)=a+(-—1)-b 


but each is, in fact, a different expression with a different meaning. Given an ordinary 
positive number a, —a is naturally defined as the result of subtracting a from 0. For 
a minute, to fix ideas, don't write —a, but use the notation (neg)a for 0 — a. Then 
note how complicated it is to define a + b for all signs of a and b. Starting with a 
and b positive, Table II gives the sums and differences of a and (neg)a with b and 
(neg)b, 

Understanding this table for the case of addition seems to be the first step in 
understanding and formalizing negatives. The second step is to extend subtraction 
to negatives so as to get the last column. This is contained in the rule: 


a — (—b) =a + b, for all positive numbers a, b. 


The basic reason for this is that we want the identity a — x +x = a to hold for all x, 
positive or negative or, in other words, subtraction should always cancel out addition. 
If we take x equal to —b, then replacing a — (—b) by a + b makes this identity 
hold. The argument one finds in some historical writings may be paraphrased as 
"taking away a debt of size x is the same as acquiring a new asset of size x", a fact 
obvious to any merchant. In any case, understanding of negatives up to this point 
seems to be a natural stage that one encounters in various historical documents. In 
modern terminology, while acknowledging that our modern words distort historical 
truth, one would paraphrase this stage by saying that it incorporates the idea that 
the integers, positive and negative are an abelian group under addition. 

But multiplication of negatives is a subtler operation, the third and final step in 
the arithmetic of negatives. Modern notation again obscures the subtlety. When you 
write the simple identity —a = (—1)-a, you are making a big step. Perhaps this is a 
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contemporary mathematician splitting hairs because historically this seems to have 
been assumed as completely natural by nearly every mathematician once they knew 
the rules for subtracting negative numbers (with the exception perhaps of Cardano 
and Harriot, see below). One difficulty in arguing for this rule is that there are not 
many simple cases of quantities in the world where the units of the two multiplicands 
allow us to infer the multiplication rule using our physical intuition about the world. 
Here are a number of ways of arguing that the identity (—1)-(—1) = +1 must hold. 


Method I: Use the basic, intuitively obvious, identity: 


distance — velocity x time 
and argue that if you substitute: 


(a) velocity 2 movementof one meter backwards per second, a negative number, 


(b) time — second in the past, also negative, 


(c) then one second ago, you were 1 meter ahead, i.e. distance = +1 meter. 


This ‘proves’ (—1) -(—1) = +1. 


Method I’: I know of only one other real world situation where the rule is intuitively 
obvious. This variant of the previous argument concerns money and time. We use 


the simple equation obvious to any merchant describing the linear growth of a 
business’s assets: 


assets at time t = (rate of change of assets) x (elapsed time t) + (assets at present) 


Nog suppose a business is /osing $10,000 a year and is going bankrupt right now. 
much money did it have a year ago? Substitute + = —1, rate = — 10000, present 
[ e obvious fact that assets a year ago = +10000 to conclude that 


iclid's g geometric algebra) 
dsg 
ation occurs Dm ey when the area of a rectangle is the 
p. Consider the diagram below: 
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The big rectangle has area a - b but the shaded rectangle has area (a — c) - (b — d). 
Since the area of the shaded rectangle equals the area of the big rectangle minus the 
areas of the top rectangle and the left rectangle plus the area of the small top-left 
rectangle (which has been subtracted twice), we get the identity 


(a —c) x (b — d = ab —bc—ad + cd, if a,b,c,d > 0,a > c,b>d 


Now we use the idea that identities should always be extended to more general 
situations so long as no contradiction arises. If we extend this principle to arbitrary 
a, b, c,d, (which will bring in negative lengths and areas), we get fora = b = 0: 


(—c)(—d) = +cd 


This approach is probably the most common way to derive the multiplication rule. It 
can be phrased purely algebraically if you extend the distributive law to all numbers 
and argue like this (using also 0 -x = Oand1-x = x): 


1 = 1+(—1)-0=1+(—1)-d+(-)) = 1+(-1)-14+(-1)-(-1) = (-1)-(-D). 
Method III: Start with the multiplication 
(positive integer n) x (any quantity a) = (more of this quantity na) 


(e.g. 4 x (quart of milk) — a gallon of milk), then by subdividing quantities as well 
as replicating them, you can define multiplication 


(positive rational) x (quantity a) 
and by continuity (as in Eudoxus), define 
(positive real) x (quantity a) 


What we are doing is interpreting multiplication of any quantity by a positive 
dimensionless real number as scaling it, making bigger or smaller as the case may 
be. Now if the quantity involved is signed you find it very natural to interpret 
reversing its sign as scaling by —1, i.e. to make the further definition: 


(—1) x (quantity a) — (quantity — a) 


Now you have multiplication by any real number, positive or negative. In other 
words, the negative version of scaling is taking quantities to their opposites. 

The core of this argument is the algebraic fact that the endomorphisms of an 
abelian group form a ring and we are constructing multiplication out of addition as 
composition of endomorphisms. This makes the third approach arguably the most 
natural to a contemporary mathematician trained in the Bourbaki style. 
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2. Negatives in Chinese and Indian Mathematics 


We will discuss China first. The classic of Chinese mathematics is the Jiuzhang 
Suanshu (Nine Chapters on the Mathematical Art). Like Euclid, this is a com- 
pendium of the mathematical concepts and techniques which had been developed 
slowly from perhaps the Zhou (or Chou) dynasty (begins c.1000 BCE) through the 
Western Han dynasty (ending 9 CE). Unlike Euclid, it is a list of practical real world 
problems and algorithms for their solution, without any indication of proofs. Since 
then, the Nine Chapters had a long history of ups and downs, sometimes being 
required in civil service exams and sometimes being burned and nearly lost. Each 
time it was republished though, new commentaries were added, starting with those 
of the great mathematician Liu Hui in 263 CE and continuing through those in the 
English translation by Shen, Crossley and Lun?. Page numbers in our quotes are 
from this last edition. 

Starting some time in the first millennium BCE, arithmetic in China began to 
be carried out using counting rods, which were arranged in rows using a decimal 
place notation. When doing calculations, different numbers were laid out by rods 
in a series of rows, forming a grid: a Japanese illustration of how they were used is 
shown in the figure below. 


Figure2. A Japanese illustration of calculation with counting rods 


The section of the Nine Chapters in which negative numbers are introduced 
and used extensively is Chapter 8, Rectangular Arrays. This Chapter deals with 
the solutions of systems of linear equations and expounds what is, to all intents 
and purposes, the method of Gaussian Elimination. In fact, it is indistinguishable 
from the modern form. The coefficients are written out in a rectangular array of 
rod numerals and one adds and subtracts multiples of one equation from another 
equation until the system has triangular form. Examples as large as five equations in 


5The Nine Chapters on the Mathematical Art: Companion and Commentary, Shen Kangshen, John 
N. Crossley, and Anthony W. -C. Lun, Oxford University Press, 1999. 
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five unknowns are worked. Naturally negative numbers appear all the time in such 
an algorithm. 

As described in Liu's commentary, red rods or upright rods were used for positive 
numbers which he calls gains (zheng) and black rods or slanting rods for negative 
numbers which he calls losses (fu). He says “red and black counting rods are used 
to cancel each other". Curiously, his colors are the exact opposite of our Western 
accounting convention! Here is Problem 8 from this Chapter, p.409 in the Shen, 
Crossley and Lun edition: 


Now sell 2 cows and 5 sheep, to buy 13 pigs. Surplus: 1000 cash. Sell 3 cows and 3 pigs to 
buy 9 sheep. There is exactly enough cash. Sell 6 sheep and 8 pigs. Then buy 5 cows. There is 
600 coins deficit. Tell: what is the price of a cow, a sheep and a pig respectively? 


This means the three equations (all of which have negative coefficients as well as 
positive): 


2C 45S —13P = 1000 
3C—98--3P = O0 
—5C-rF6S-r8P. =  —600 


The solution is found to be C = 1200, S = 500, P = 300. The Nine Chapters goes 
on rather mysteriously (p.404): 


Method: Using rectangular arrays lay down counting rods for each entry to be added. 
The Sign Rule 


Like signs subtract; opposite signs add; positive without extra, make negative; negative 
without extra, make positive. 
Opposite signs subtract; same signs add; positive without extra, make positive; negative 
without extra, make negative. 


Liu's commentary explains: the first set of sign rules refers to subtraction of array 
entries, the second to addition. He goes on to clarify the meaning of the cryptic 
Sign Rule. In fact, the rule is precisely what we wrote out in Table II above for both 
addition and subtraction. What is clear is that negative numbers were analyzed and 
treated correctly as soon as the need arose, presumably for the first time anywhere 
in the world. 

I cannot find in the Shen et al edition of the Nine Chapters any treatment of 
multiplication of negative numbers, although Martzloff® quotes the Chinese edition 
of Qian Baocong as saying: “Rods of the same name multiplied by each other make 
positive. Rods of different names multiplied by each other make negative". In any 


6 Ch. nd 
Jean-Claude Martzloff, A History of Chinese Mathemati 203. 
J cs, 2' edition, i e 
Springer, 1997, pag x: 
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case, Liu's commentary, written in the 3 century CE, makes the remark (p.405): 
"Interchanging the red and black rods in any column is immaterial. So one can 
make the first entries of opposite sign." 'This is the correct rule for multiplication 
by —1. 

Chinese algebra had a renaissance in the Song and Yuan (Mongol) dynasties. In 
particular, Zhu Shijie (c.1260—c.1320) extended the ideas of Gaussian elimination 
to the simultaneous solution of polynomial equations, inventing the equivalent of 
the resultant and using ever larger and more complex arrays of coefficients. At 
this stage, as one would expect, the full rules for negative arithmetic emerge quite 
explicitly as well those for the algebra of polynomials. Having a theory of negatives 
is the clear prerequisite for going further in the study of algebra. Zhu's algebra 
reached a stage not attained in Europe until the late 19" century. 

I want to turn to India next. In every culture, one of the main reasons for the 
development of arithmetic — arguably the principle driving force — is the need 
of merchants to keep accounts. In fact, it is even hypothesized that arithmetic and 
writing itself emerged in the 3'* millennium BCE in Mesopotamia as a development 
of a crude system of tracking transactions of agricultural goods by means of small 
specially shaped and inscribed tokens’. By around the year 2000 BCE, one finds 
tablets from Ur? with a yearly summary accounting, showing budgeted and actual 

inputs (with value converted into a common unit of barley), budgeted and actual 
outputs, budgeted and actual labor and differences, shortfalls or profits! In India, very 
sophisticated principles of accounting were codified in Kautilya's comprehensive 
manual of statecraft, the Arthasatra written in the 4'^ century BCE. The Arthasatra 
covers in amazing detail every aspect of setting up and managing of a kingdom 
(including managing a special forest for elephants). In Book II, Chapter 6 and also 
in many later Chapters of Book II, Kautilya details how accounts are to be kept?. 
He describes a complete system of book keeping: he has a ledger for income with 
dates, times, payers, categories, etc. and a ledger for expenditures and finally a 
third ledger for balances. There are sections on auditing, insurance against theft, 
debtors, borrowings, mortgages, auditing, etc. and subtler accounting issues such as 
current vs. deferred receipts, how to account for price changes of items in inventory, 
fixed vs. variable costs. Although he does not use negative numbers explicitly, he is 


"The pioneer here has been Denise Schmandt-Besserat, who has brought her life's work together in 
the multi-volume book Before Writing, volume I being From Counting to Cuneiform, University of Texas 
Press, 1992. In particular, she has “deciphered” the mysterious tokens found throughout the Middle East 
from roughly 8000 BCE to 3000 BCE, finding a simple method of accounting which merges seamlessly 
with highly developed cuneiform accounts in the 3@ millennium. 

SSee Chapter 5 in Richard Mattessich, The Beginnings of Accounting and Accounting Thought: 
Accounting Practice in the Middle East (8000 B.C to 2000 B.C.) and Accounting Thought in India (300 
BCE to the Middle Ages), Garland Publishing, 2000. : 

9See Chapter 6 in Mattessich, Op. Cit., which is based on the book Modern Accounting Concepts in 
Kautilya's Arthasastra by Anjan Bhattacharyya, Firma KLM, Calcutta, 1988. 
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clearly aware of how accounts must sometimes show a deficit and that people may 
carry a negative net worth. 

Although the Arthasatra does not mention negative numbers explicitly, they 
appear full blown in Brahmagupta's treatise Bráhma-sphuta-siddhánta (628 CE). 
The development of mathematics in India in the first millennium CE is connected 
much more strongly to astronomy than to accounting. For much of this period, 
treatises covering both mathematics (ganita) and astronomy (the motion of the sun, 
moon and planets and their positions at a given time and place in the sky) and called 
siddhantas were composed. Many of these were in verse, highly compressed and 
cryptic, meant to be memorized and handed down generation by generation from 
teacher to student. 

The Bráhma-sphuta-siddhánta includes two Chapters on mathematics which are 
a compendia of the mathematical concepts and techniques which had been devel- 
oped over previous centuries. Here we find all the correct rules for arithmetic with 
negative numbers and in it positive numbers are referred to as "fortunes", negative 
numbers as "debts". It appears that accounting led naturally to an arithmetic in 
which negative numbers took their natural place. Here are some quotations, show- 
ing first the rules we laid out in table I and then, significantly, going on to describe 
how to multiply negative numbers !?: 


[The sum] of two positives is positive, of two negatives, negative; of a positive and a negative 
[the sum] is their difference; if they are equal, it is zero. The sum of a negative and zero is 
negative, of a positive and zero positive, of two zeros, zero. 


[If] a smaller [positive] is to be subtracted from a larger positive, [the result] is positive; [if] 
a smaller negative from a larger negative, [the result] is negative; [if] a larger from a smaller, 
their difference is reversed — negative becomes positive and positive negative. 


The product of a negative and a positive is negative, of two negatives positive, and of positives 
positive; the product of zero and a negative, of zero and a positive, or of two zeros is zero. 


A positive divided by a positive and negative divided by a negative is positive; a zero divided 
by a zero is zero; a positive divided by a negative is negative; a negative divided by a positive 
is negative. 


Chapter 18, verses 30-34 


The only oddity seems to be his confident assertion that 0/0 = 0. The rest is 
as clear and modern as one could wish for. It would be wonderful to know what 
considerations led Indian mathematicians in the late centuries BCE or the early 
centuries CE to these conclusions — especially for the multiplication of nega- 
tive numbers. The predominately oral transmission of knowledge in the Vedic 


10We quote from the translation by Kim Plofker in her book, Mathematics in India, 500 BCE — 1800 
CE, Chapter 5, p.151. 
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tradition — and perhaps the difficulty of preserving perishable writing materials 
through yearly monsoons — has not left us with any record of these discoveries. 
They just appear full blown in Brahmagupta's summary. R.Mattessich has devel- 
oped at length the idea that it was the highly developed tradition of accounting which 
led to the full understanding of negative numbers’! but unfortunately no evidence 
for this plausible conjecture exists. 

As in China, having negative numbers opened the way to deeper studies of 
algebra itself. Perhaps the deepest of these was the Indian work on Pell’s equation 
x? — Ny? = m, especially finding solutions for m = 1. Brahmagupta himself made 
the first huge step, discovering the multiplication law arising from the factorization 


x? — Ny = (x + Vy) (x - Vn»). 


More exactly, he showed how from solutions of the equation for mı, m2, one gets 
one for their product m = m -m2. Some centuries later, Jayadeva found a complete 
algorithm for constructing solutions with m = 1. 

We find reflections of the Indian use of negatives in their astronomy too. As 
stated, the main goal of these scholars was not to develop mathematics for its own 
sake but to apply mathematics to predict the positions of the sun, moon and planets. 
An epicyclic theory is used and, for the planets, both a ‘slow’ and ‘fast’ correction 
is added to the mean motion of the planet (in our terms, one is due to the ellipticity 
of their orbit, the other to the shift from a heliocentric to a geocentric description). 
David Pingree’ has hypothesized that through the intermediary of the Indo-Greek 
empire, some version of the pre-Ptolemaic Hipparchan theory of planetary motion 
reached India. What is quite striking is that in making these corrections the sine 
function in all four quadrants is understood. Hipparchus had computed tables of 
chords, which are : fundamentally unsigned positive quantities. The Indian tradition 
shifts to sines (actually ‘Rsines’, sines multiplied a large radius and rounded to the 
nearest in integer) 2 and then it is natural to extend them from the first quadrant to the 
au eds isa quote. from the Bráhma-sphuta-siddhánta, Chapter 2, verse 16 

‘the co C pu made by adding or subtracting appropriate sine function 


b pm "n uh 
TEED the slow case) negative, positive, positive, negative correction, 
i of two, sitives (is) positive, of two negatives (is) 
ir their) difference, of equals (positive and negative is) 
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It would be nice if they had drawn a graph of the correction in all quadrants, i.e. of 
the sine function, to clarify this verse, but that was clearly not their modus operandi. 
But further evidence that the sine function was seen as being extended to more than 
one quadrant comes from the rational approximation of the sine in the first two 
quadrants given by Bhaskara I (7? century CE)!*: 
inv eee CE 002m 
5z? — 40(x — 0) 

This is an extraordinarily accurate approximation which would be hard to come up 
with if they had not grouped the first and second quadrant together. 

Another natural place for using negative numbers is for coordinates, e.g. to 
measure the celestial latitude (perpendicular to the ecliptic), or the declination (per- 
pendicular to the celestial equator), of a planet or star. Tradition, however, sanctifies 
describing latitudes and declinations as north/south instead of positive/negative and 
this is hard to change. But this latitude must often be put into formulas when convert- 
ing from celestial coordinates to horizon based coordinates, e.g. when calculating 
the very important rising times of planets. At this point, rules for negatives again must 
be used. Here is an example from Brahmagupta's Khandakhádyaka, Ch.6, verse SP 


Multiply the north celestial latitude by the equinoctial shadow and divide by 12; apply the 
quotient taken as minutes negatively or positively to (the longitude measured from) the orient 
and occident ecliptic points. When the celestial latitude is south, apply the resulting minutes 
to the same points positively or negatively. 


In modern terms (see Figure 3), he is computing (longitude KA) + (latitude K V) 
x tan($), where $ is the observer's latitude and distinguishing the cases where 
longitude is measured eastwards or westwards and where the planet's latitude is 
north or south. 

An explicit interpretation of negatives as coordinates on a number line occurs 
later in the work of the 12" century Bhaskara II (so-called to distinguish him from 
the earlier 7? century Bhaskara I). He wrote an immensely popular textbook on 
Algebra, the Lílávarí!9. The title was said by a Persian translator to be the name of 
Bhaskara's daughter and, although this is not made explicit in the book, it is full of 
verses addressed to the "beautiful one", “the fawn-eyed one", etc. Present day texts 
are so drab in comparison! 

The remarkable passage is in verse 166 and again it is given without any fanfare 
stating that a new interpretation of negative numbers is being given. But, to my 


'4Bhaskara I, Mahabhaskariya, Ch. 7, verses 17-19. 
I5The Khandakhádyaka of Brahmagupta, with the commentary of Bhattotpala, edited and translated 
by Bina Chatterjee, World Press, Calcutta, 1970, p.122-3. 


l6we follow the classic translation by H. T. Colebrooke, first published by in 1817 and subsequently 
reprinted in numerous editions. 
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Figure 3. Diagram for calculation of rising time. Planet at V, 
r K A ecliptic, P north celestial pole. 


ee knowledge, it is the first occurrence of the “number line", of using positive and 
€ negative numbers as coordinates on either side of an origin. Bhaskara is in the 
| middle of a discussion of triangles and, specifically, given the three sides a, b, c of 

the triangle with a distinguished side c, the base, how to find the altitude and the 
position of the foot of the perpendicular dropped on the base. If you let x be the 
distance from one endpoint of the base to the foot, then (c — x) is the distance from 
the other endpoint to the foot and Pythagoras's theorem tells us: 


PE ND 


a? — x? = altitude? = b? — (c — xy 


5 (see Figure 4). Aha: what to do? 
ot of the perpendicular lies outside 
ania co coo coy 
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Figure 4. A triangle with a perpendicular falling outside the 
base, Bhaskara II 


time. Nonetheless, as we will see, it doesn't occur in Europe before the work of 
Wallis near the end of 17" century. 


3. TheShunning of Negative Numbers, From Al-Khwarizmi to Galileo 


I now turn to the Arab and Western treatment of negative numbers. To keep the 
story within bounds, I will pick a small selection from the many figures who might 
be discussed, those who seem to me key figures in the story or who exemplify a 
particular stand. 


Al-Khwarizmi (c.790—c.840) 


It is repeated everywhere that the Indians invented zero and place notation and that 
the Arabs learned it from them and later transmitted this to Europe. It's bizarre 
that such a misunderstanding should be widespread but in fact, the Babylonians 
invented place notation (albeit using base 60) and their arithmetic was used by 
many Greeks, e.g. Ptolemy. I hope I have made the case that the most substantial 
arithmetic discovery of the Indians — and independently the Chinese — was not 
merely that of zero but the discovery of negative numbers. Sadly this discovery was 
not absorbed in any but a superficial way by the Arabs. 

Al-Khwarizmi (whose full name was Abu Ja'far Mohammad ibn Musa Al- 
Khwarizmi) was familiar with Indian mathematics and astronomy and apparently 
with Brahmagupta's Bráhma-sphuta-siddhánta written some 200 years earlier. He 
worked under the patronage of the caliph Al-Mamun about whom he says "That 
fondness for science, ..., that affability and condescension which he (the caliph) 
shows the learned. . . has encouraged me to compose a short work on calculating by 
Completion and Reduction .. such as men constantly require in cases of inheritance, 
legacies, partition, law-suits and trade ...”'” His book on Algebra is entitled Al- 
jabr w'al muqabala which refers to the operations of completion and reduction with 


V'The Algebra of Mohammed ben Musa, Translated by Frederic Rosen. Facsimile reprint of 1831 
edition by the Oriental Translation Fund, London, Adamant Media Corporation 2002. 
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i3 . which he simplified his equations. These were relations between an unknown, its 
HJA square and constants, given in prose. Nearly half of his book concerns incredibly 
HH complex inheritance cases. 

l BILD ? I find three things especially striking in this book. Firstly, negative numbers 
| |! pig: appear only once, in a section on multiplication whose goal appears to be to explain 
hi | the identity 


Í 
il [ (a — c)- (b — d) = ab — ad — bc + cd 


and justify it by geometry, just as in our discussion of *Method II" for multiplying 
FE - negative numbers. But then they are never mentioned again. The second striking 
a 


thing is that quadratic equations always have positive coefficients and thus belong 
to three types: 


t 
[| 
n. ] l. ax? + bx =c,a,b,c > 0 (referred to as "roots and squares are equal to 
" numbers") 


T1 i 2. ax? +c — bx,a, b,c > 0 ("squares and numbers are equal to roots”) 
1 D 
3. ax? = bx +c, a, b, c > 0 ("roots and numbers are equal to squares") 


This separation of cases continues down through the whole European tradition 
through De Morgan. An equation, in short, must be an identity between two positive 
numbers. Thirdly, he discusses exactly the same problem that Bhaskara II was to 
take up: finding altitudes of triangles whose sides are given. But, unlike Bhaskara, 
all the examples he treats have the foot of the perpendicular inside the base so this 
big clue about negatives never comes up. 

iuo 


Lec ENE of Pisa (1170-1250) 


ue n and place notation. He wrote aremarkable book, 
orca ), in which the rules for all the basic arithmetic 
t in great and exhaustively illustrated by numerical 
of | his book which i is essentially what we 
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involving a group of people who, after exchanging various sums of money, have 
sums satisfying some linear relationships. Here is an example!®: 


Three men had pounds of sterling, I know not how many, of which one half was the first's, one 
third was the second's and one sixth's was the thirds; as they wished to have it in a place of 
security, every one of them took from the sterling some amount, and of the amount that the 
first took he put in conimon one half, and of it that the second took, he put in common a third 
part, and of that which the third took, he put in common a sixth part, and from that which they 
put in common every one received a third part, and thus each had his portion. 


In modern algebra terms, if S is the sum of sterling and x;, x2, x3 are the sums which 
the three men took, so that (x; /2 + x2/3 + x3/6) is what “they put in common", 
then the last sentence “each had his portion", sets up three equations: 
X] l/a x2. x3 S 
T-3(24292)95 


TOND 6 2 
2x2 l /xi x2 X3 S 
3 ball 4 3) =3 
5x3 1 X] X2 X3 ES 
tse) 6 


This is only one of hundreds of such problems. He develops methods of laying 
out the coefficients in rows and manipulating the numbers to get the answer. In the 
above, the ‘answer’, is the smallest set of relatively prime x’s which solve these three 
homogeneous equations in 3 unknowns. Leonardo has a rather awkward and special 
version of the Chinese algorithm for solving linear equations in many unknowns. 

Now most of his problems are set up so all the numbers which occur are positive. 
But not all! First of all, negative numbers can arise in the course of the calculation. 
He then says things like!?: 


[he is in the middle of an algorithm]. . . and from the 240 you subtract 288 leaving minus 48, 
and this I say because the 288 cannot be subtracted from the 240; from this 48 you take 1/3 
for the 1/3 of the second position; there will be minus 16. . . . 


He is getting close to the red and black rods of the Chinese, but these examples are 
few and far between and are not pursued very far. In a few other cases, the answer 
itself is negative. For example, after solving the problem described in the first quote, 
he varies the proportions of S owned by the three men to 1/2, 2/5 and 1/10. In this 
case, the solution is x; = 326, x2 = 174 and x3 = —30. The setting of the problem, 
that all the x's are amounts of money, comes to his rescue. The third man, he says, 


181 eonardo of Pisa, Liber Abaci, p.415 of the English translation by L. Sigler, Fibonaccis' Liber 
Abaci, Springer-Verlag, 2002. 
I9rbid, p.419. 
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5 does not take anything from the sum S which they share but instead puts in an 
11 additional 30 pounds of his own “proper” money: there were 470 pounds in all, and 
P when they “wanted to have it in a place of security”, the third man added 30, the 
PT first man took 326 and the second took 174. When money is concerned, negative 
quantities can always be given a simple meaning! 

Leonardo is making the first tentative steps towards enlarging the number system 
rp} to include negatives. With money, he is comfortable with assets and debts, giving 
Ie and taking. But his examples are few and he never makes explicit rules for extending 
arithmetic. 


Nicole Oresme (1323-1382) 


$ 
} 
| 
1 Nicole Oresme was a mathematically inspired scholastic, working in Paris in the 
n pa mid-14" century. He made a giant stride taking geometry beyond Euclid. In his 
i i great book, Tractatus de configurationibus qualitatum et motuum (Treatise on the 
configurations of qualities and motions)", he proposed considering all intensities 
| i which varied in time and whose values at different times could be compared by a 
| proportion. To any such quality, he proposed constructing a graph. First he took a 
l line segment, called the subject, whose points represented the interval of time over 
which the quality was varying. This, in itself, was a radical departure from Euclid: 
now space was being used analogically, as a substitute for time. Then he proposes 
erecting line segments perpendicular to the subject whose lengths had the same 
proportions as the qualities being graphed: 


——— 


Therefore, every intensity which can be acquired successively ought to be imagined by a 
straight line perpendicularly erected on some point of the space or subject of the intensible 
thing, e.g. a quality. For whatever ratio is found to exist between intensity and intensity of the 
same kind, a similar ratio is found to exist between line and line, and vice versa. . .. Therefore, 
the measure of intensities can be fittingly imagined as the measure of lines. (Oresme, Li) 


He talks about graphing many things (although he never gathers data or actually 
goes beyond making simple cartoons of his graphs — see Figure 5). In particular, 
he discusses graphing velocity, temperature, pain and grace (of a soul). Some of 
these are clearly positive quantities by nature, e.g. pain and grace. He is interested in 
contrasting intensities which are constant (graph (a) in figure), intensities which vary 
at a constant rate (graph (b) in the figure) and intensities which are more complex 
(graphs (c) in figure). For example the grace of a soul ‘occupied by many thoughts 
and affected by many passions’ will be difformly difform — his name for type (c). 
On the other hand, velocities can clearly change sign and, as for temperature, he 
even considers there to be complementary intensities of hotness and coldness. For 


20T;anslations are from Marshall Clagett's translation, Nicole Oresme and the Med lenal Geo 


of Qualities and Motions, University of Wisconsin, 1968. 
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al 


Figure 5. Oresme’s examples of graphs 


temperature, hotness might have a graph with values f(x) and coldness a graph 
with values C — f(x). In other words, he adds a suitable positive constant so as to 
make every intensity positive everywhere. 

Because his graph is the whole area, not simply the curve at the tips of the his line 
segments, he cannot have a graph which goes from positive to negative, crossing 
the ‘subject’. This is especially striking because at one point he makes a catalog 
of various types of difformly difform graphs: but no graph in the catalog is, for 
example, regularly oscillating like a sine wave. He even hints at the fact that the 
area of the graph of velocity is the distance traveled, the fundamental theorem of 
calculus, but to make his picture, the velocity cannot change sign: no backtracking. 
Oresme has gone beyond Euclid in a striking way but he cannot make the further 
leap of allowing negative values for an intensity. 


Luca Pacioli (1445-1517) 


Pacioli's importance is not due to his discoveries but to the fact that he wrote an 
encyclopedic work Summa de arithmetica, geometria, proportioni et proportion- 
alita which summarizes the contemporary knowledge of arithmetic, geometry and 
especially accounting. The work's greatest influence was due to its description of 
double-entry book keeping which was a key step in the expansion of the interna- 
tional business enterprises which characterized the Renaissance. Here we find a 
small number of linear equations involving amounts of money whose solution is 
negative. As in Leonardo, when the result was a negative number, it is described as 
a debt. In one case, the price of an egg comes out negative — owning the egg puts 
you in debt so the sellers are paying you to take their eggs. 
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eh Pacioli’s writings which is more exciting. There is an untitled manuscript, writ- 
i ten for his students in Perugia, which survives in the Vatican?!. A standard class 
of problems (going back to Babylonian times) involves dividing a number into 
H | two parts which satisfy some quadratic condition. After solving some such prob- 
ni b lems with positive solutions, he comes to what he calls the bellissimo caso. This 

f 

+ 

4 

i 


i | $ 
i | Sesiano (op.cit.), however, tracked down one isolated instance of a problem in 
l 


example asks you to divide 10 into two parts the difference of whose squares is 

: 200. The reader may like to check that the answer is 10 = 15 + (—5). Here 

i is a problem not only in pure numbers one of which is negative but requiring 
squaring this negative number. Although an obscure and forgotten footnote to his- 

i tory, it seems that the young Pacioli ventured briefly into uncharted territory in 

i a truly original way. It is unfortunate that in his Summa, he did not pursue these 
ideas. 


| $ Girolamo Cardano (1501-1576) 


f The only reason to include Cardano is that he wrote the book Ars Magna”, so 
we can analyze how he thought, how he looked on negative as well as imaginary 
numbers. The solution of cubic equations was due to Scipione del Ferro, Pro- 
fessor of Mathematics at Bologna around 1515, and the solution of the quartic 

to Cardano's student Ludovico Ferrari. Cardano himself was an arrogant man, a 

compulsive gambler, who led a wild life of ups and downs. That he computed 

the odds of various sorts of gambling was arguably his greatest mathematical 
achievement. 

__ If Al-Khwarizmi had spun out the solutions for quadratic equations in to many 

different cases, Cardano really went to town describing how to solve 13 distinct 

cases | ic equation (and 44 types of derivative cases). Why so many? Because 
nts all had to be positive and (b) the equation had to equate a positive 
her positive quantity. The many sections are entitled DUE like *On 


^. e doss explore a bi what algebra 
NI Fi 
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can do for you if you admit negative roots. His example of a problem requiring 
negative numbers is this: 


The dowry of Francis’ wife is 100 aurei more than Francis’ own property, and the square (?) 
of the dowry is 400 more than the square of his property. Find the dowry and the property. 


This works out to give Francis —48 aurei of property, that is, he is in debt 48 aurei, but 
fortunately is getting a dowry of 52 aurei. Here he correctly identifies the negative 
solution with a debt. This is an excellent illustration although squaring a sum of 
money is a pretty weird thing to do. 

There would little else to say except for the curve ball that was thrown to Cardano: 
for all cubic equations which have only one real root, del Ferro's formula worked 
like a charm. But if there were three real roots (the other possibility, known as the 
casus irreducibilis), it gave an apparently meaningless result. His formula for the 
roots of the equation x? + ax + b = O is: 


x=} (0/2 + /=0/421)+ J (oz ” /=27427), where D = —4a?—27b? 


D, the discriminant, is equal to the square of the difference of all pairs of distinct 
roots, hence it is positive if all the roots are real. So we need to find the a square 
root of a negative number even though in the end we only want the real number x. 
Cardano struggled unsuccessfully with what this might possibly mean. 

His one attempt to deal with these complex expressions is in the same Chap- 
ter, “On the rule for postulating a negative” mentioned above. Here he considers 
problems which have complex roots, such as the following: 


Divide 10 into two parts the product of which is 40. 


The usual quadratic formula gives the two parts as 5 + / —15 and 5 — /—15. This 
is also the answer his math gives him and which he puts in writing in his book but 
he doesn't attribute much meaning to it. He makes his famous comment: 


So progresses arithmetic subtlety, the end of which, as is said, is as refined as it is useless. 


At the end of this Chapter, he gives a third type of example where he reasons 
incorrectly with products of a real and an imaginary. In a later edition, he added 
an appendix De aliza regula liber in which he flirted with the idea that maybe 
(71? = +1 was wrong. Why not try (C1)? = —1? Between ‘fictitious’ neg- 
ative numbers and useless imaginaries, you get the sense that Cardano was at 
sea. 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


NI 134 David Mumford 


itn Galileo (1564-1642) 


Perhaps mathematicians were stuck thinking that negative numbers were fictitious 
i but surely physicists who were actually measuring things in the real world, had a 
i | DE f clearer view? Arguably, Galileo's great contribution to physics was his recognition 
i that momentum was a key property of objects, that it was constant when no forces 
| Wy were acting and that the force of gravity acting on projectiles and falling bodies 
| changed their momenta at a constant rate, not their positions. As an old man, when 
| the Pope commuted his sentence for heresy to house arrest, he wrote down these 
theories in his Dialog concerning Two New Sciences”. He starts off with his foil 
Simplicio getting put down again and again by Galileo's mouthpiece Salviato. But 
by the Fourth Day, Galileo lapsed into a more standard Euclid-style exposition 
and puts out the centerpiece of his theory: the demonstration that a projectile 
i follows a parabolic arc under the force of gravity. Here was something he had 
| actually experimented with and he was on solid ground, theoretically as well as 

| d experimentally. Figure 6 is an excerpt from his notebooks working on projectiles. 
i ! i The central assertion in these dialogs is that gravity endows the projectile with 
| a constant downward acceleration. Thus its vertical velocity will be positive going 
up, zero at the peak and negative coming back down. It is a linear function changing 
from positive to negative. The math couldn't be simpler — if you are willing to use 

negative numbers. 
What does Galileo do? His main result is: 


mnt tom e mq m 


Theorem 1. A projectile which is carried by a uniform horizontal motion com- 
pounded with a naturally accelerated vertical motion describes a path which is a 
semi-parabola. 


Note that he uses a semi-parabola: the half of the parabola in which height is 
a monotone function of time. Considerably later, after a long discussion of the 
time and distance of the semi-parabolic arc carrying the projectile to the ground, 
time without any discussion and concludes that the rising phase of a 


in the book resemble the figure from his notes: a 
xiliary chords and tangents. He analyzes the geometry 

ysics of a falling body and then asserts without any 
T the direction. of motion from a fall to a climb — 
es at the apex from positive to negative; is not 


tol m seg ae 
7: Two New Sciences, trans 
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Figure 6. Galileo's notes on projectiles 


Fermat (1601—1665) 


135 


Fermat and Descartes, at essentially the same time, had the idea of introducing 
coordinates into the plane and connecting geometric loci with polynomial equations 
intwo variables. Plane curves are not confined to the positive quadrant, so one might 
expect that their logic would have pushed them to allow their variables to take on 
both positive and negative values. But no! Their coordinates were only in a positive 
quadrant and the other parts of a curve were treated separately if at all. 
Below are two figures from Fermat's paper on the subject, Ad Locos Planos 
et Solidos Isagoge, (Introduction to Plane and Solid Loci). Incidentally, plane loci - 
meant lines and circles, solid loci meant the other conic sections, terminology which 


dates from Greek times. 
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Figure 7. Two diagrams from Fermat’s Isagoge 


In these figures N is the origin, NM is the x-axis (although Fermat used the letter 
A, not x because his variables were vowels), N D or N P is the y-axis (the letter E 
for Fermat), x = NZ, y = ZI sol is the point with coordinates (x, y). On the left, 
he is describing the locus of the equation: 


d*+x-y=r-x+s-y (which he writes Dpl. + A in E aeq. R in A + S in E). 


Here s — NO,r — ND and d* is a constant area, so we have a rectangular 
hyperbola, centered at V, with asymptotes V O and V P. The curious point is that he 
— . — — draws only this small part of the hyperbola, cutting it off on the x-axis. He also cuts 
|» jtoffatthe plotted point Z. On the right, he is describing a parabola with equation: 


x =d. y (which he writes Aq .aequatur D in £) 


Again, he cuts the locus off at his axes (and at /). 
Descartes' treatment is similar, except that he does say in the text that there are 
i multiple orderings possible for the relevant points on the axes and that you must set 
up different equations depending on the directions and ordering of both the variable 
point and the constants i in the construction. The goal is to make both sides of your 
equatic S of positive quantities, just as in Al-Khwarizmi and Cardano’s work 
; and cubic equations. Note that this is how Fermat's version of the 


What's so Baffling About Negative Numbers? 137 


algebra and coordinates, Newton was equally modern in his treatment of negative 
numbers, putting them on equal footing with positive numbers. So we should 
attribute the first clear European view of negative numbers to Wallis and Newton 
equally. 

In Chapter 16, Addition, Subduction, Multiplication and Extraction of Roots in 
Specious Arithmetic, Wallis defines negative numbers as nicely, simply and clearly 
as you could wish (here ‘Specious’ is Viete's term for arithmetic with variables 
given by letters): 


To these Notes, Symbois or Species are prefixed (as occasion requires) not only numeral 
figures, but the signs + and — (or plus and minus), the former of which is a Note of Position, 
Affirmation or Addition; the other of Defect, Negation or Subduction: According as such 
Magnitude is supposed to be, or to be wanting. And where no such Sign is, it is presumed to 
be Affirmative and the sign + is understood. 


And accordingly these Signs are still to be interpreted as in a contrary signification. If. 
+ signify Upward, Forward, Gain, Increase, Above, Before, Addition, etc. then — is to be 
interpreted of Downward, Backward, Loss, Decrease, Below, Behind, Subduction, etc. And if 
+ be understood of these, then — is to be interpreted of the contrary. 


In this quote, the capitalization is his. With this understanding of negatives, how 
does he justify the rule for multiplying negatives? Here is what he says: 


For the true notion of Multiplication is this, to put the Multiplicand, or thing Multiplied 
(whatever it be) so often as are the Units in the Multiplier. ... and this, whatever the 
thing Multiplied, Positive or Negative: for there may well be a Double Deficit as a Double 
Magnitude; and —2A is as much the Double of -A as +2A is the Double of A... . 


But in case the Multiplier be a Deficit or Negative quantity; suppose —1; then instead of 
Putting the Multiplicand so many times, it will signify so many times to Take away the 
Multiplicand. ... so that + by — makes —; But to Multiply -A by —2 is twice to take away a 
Defect or Negative. Now to take away a Defect is the same as to supply it; and twice to take 
away the Defect of A is the same as twice to add A or to put 2A . . . : So that — by — (as well as 
+ by +) makes +. 


As far as I know, this is the first place in Western literature in which the rule of signs 
is not merely stated but explained so clearly. After this, when he gets to writing 
out the formulae for roots of equations, he no longer has to separate all these cases 
which we saw in Al-Khwarizmi and Cardano. For the quadratic he writes: 


Given the equation, x*+2bx = +c? 
therootsarex +b = V-+c?+b? 


(I have only changed his variable from a to x and noted squares by using e.g. c? for 
his cc.) Note that he follows Euclid is making all terms homogenous — so that, for 
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- example, x, b, c can all be lengths and the equation relates an area to an area. For 
this reason, he needs the symbol + in front of the c?. 

| Finally, Wallis gives what I believe is the first explicit use of the full number line, 
| » | . positives to the right, negatives to the left, in Western literature: 
| 

| 


m pe M 
- oramg » 
—— 


t 5 Yet is it not that Supposition (of Negative Quantities) either Unuseful or Absurd when rightly 
understood. And though, as to the bare Algebraick Notation, it import a Quantity less than 
nothing: Yet, when it comes to a Physical Application, it denotes as Real a Quantity as if the 
Sign were +; but to be interpreted in a contrary sense. 


D 
l 
L| 
f 
| As for instance: Supposing a man to have advanced or moved forward (from A to B) 5 yards; 
| 4 Í and then to retreat (from B to C) 2 yards; If it be asked, how much had he Advanced (upon 
j the whole march) when at C? I find... he has Advanced 3 Yards. But if, having Advanced 5 
: | E 1 Yards to B, he thence retreat 8 Yards to D; and it then be asked, How much is he Advanced 
j when at D, or how much Forwarder than when he was at A: I say —3 Yards. ... That is to say, 
| he is advanced 3 Yards less than nothing. ... (Which) is but what we should say (in ordinary 


form of Speech), he is Retreated 3 Yards; or he wants 3 Yards of being so Forward as he was 
at A. 


n E! ( H 
Sea 


Figure 8. Wallis’s illustration of the “number line” 


. Newton, as one would expect, had a full command of negative numbers and all 
es. He wrote lecture notes on arithmetic, algebra and geometry at some point, 
i ly in his career. They were first published (without his approval) in 

lated into ee with the title Universal Arithmetick. Beeli ie 


— 


ier Affirmative or greater than nothing; or Negative, or less than nothing. 
f tock may be called affirmative goods, and debts 
progression may be called affirmative motion, and 
h 1 gud and the other diminishes the ce 


: cls pr numbers arise as — 
© ith Saeed hw 
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pure numbers. He states the rule for the sign of the product simply as“... making the 
product Affirmative if both factors are Affirmative or both Negative; and Negative if 
otherwise." Unfortunately, he says nothing about why one should believe in this rule. 

Whereas Fermat had given a systematic study of quadratic equations in two 
variables showing that they all defined conic sections and Descartes had introduced 
several cubic equations giving new curves (notably the “Cartesian parabola" and 
his *Folium"), Newton went on to look at all possible cubics, in an article entitled 
“Curves” in Lexicon Technicum by John Harris published in London in 1710. He 
classified them into 72 types and sketched them. Without hesitation, he used all four 
quadrants of the plane and plotted all roots (x, y), positive and negative. Here is an 
example: 


Figure 9. One of the 72 types of cubic curves plotted by Newton 


After Wallis and Newton's work, a modern arithmetic with negative numbers 
was widely accepted in Continental Europe, where there was an explosion of math- 
ematical research during the Enlightenment. In England, curiously, the resistance 
to negative numbers continued for some 150 years, culminating in De Morgan. A 
long debate ensued between those who accepted them and those that didn't, a story 
which is beautifully described in Pycior's book that we have cited. In the end, De 
Morgan and Hamilton founded the general theory of fields and negative reals took 
their place in the greater world of complex numbers and quaternions. 


5. Two Factors in the World View of 15'^—17*^ Century Europe 
I hope I have proven my point that Europe in the 16* and 17" centuries resisted 


expanding their numbers to include negatives in a way which calls for some expla- 
nation. China and India both seem to have moved naturally to this bigger domain of 
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| £ numbers when the occasion presented itself. I want to make the case that the Euro- 
H | pean reticence was due to two factors. The first was the overwhelming importance 
$ of Euclid in defining what is and what is not mathematics and the fact that negative 
numbers had no place in Euclid's view of mathematics. The second is that, at the 
time negative numbers should have been accepted, imaginary numbers cropped up 
too and the idea arose that both negative and imaginary numbers had the same 
twilight existence. It was because of negatives that square roots had a problem, so 
maybe it was best to consider them both as second class citizens of the world of 
numbers. 

Euclid's Elements were written in the newly founded school/library at Alexandria 
around 300 BCE and integrated the mathematical ideas of Theaetetus, Eudoxus and 
many others in a systematic treatise. It is written in a monolithic theorem/proof 
style not seen again in the History of Mathematics until the collective ‘Bourbaki’ 
composed their treatise in the 20" century. It was translated into Arabic in the 8" 
century CE and from Arabic into Latin in 12" century. As a result, it came to define 
what mathematics is for every generation of Arabs and Europeans, arguably until 
Newton and the Enlightenment when concepts with no roots in the Elements began 
to take center stage. 

But what is Euclidean mathematics? There are roughly three parts to the Ele- 
ments: Books I-VI on plane figures, Books VII-X on number theory and irrationals 


and Books XI-XIII on three dimensional geometry. What numbers occur in the 
Elements? Here's a list: 


" > 
umi gm Pr emen 
— 


aat Ces n 


1. "magnitudes": the length of a line, the area of a plane figure and the volume 
of a solid figure 


2. positive integers implicitly as in “The greater is a multiple of the less when 

it is measured by the less" (definition 2, Book V) and explicitly as in *A 
‘is a multitude composed of units” (definition 2, Book VII). Note that 
is still al a length but, because he always has a “unit” around when 
ES , it becomes in effect dimensionless. 


d | volumes, í etc. But "units" 
otim E 
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added to itself as many times as there are units in the other and thus some 
number is produced". 


3. Adding and multiplying ratios is the main goal in the extremely abstract Book 
V, which is said to be the work of Eudoxus. Book V begins with defining 
when two ratios are equal. For any ratio given by two lines A and B, he 
considers which multiples satisfy nA > mB and which satisfy nA « mB. 
Of course, this is the ‘cut’ Dedekind re-introduced in the 19" century to 
construct real numbers from rationals. Here Eudoxus doesn’t need to define 
real numbers — they are ratios given by geometry. What he needs to do is to 
define equality of ratios and he does this by requiring that their associated 
cuts are the same. Addition and multiplication of ratios are both implicit in 
that (a) if a line segment A is divided into two parts B and C then A : D 
is going to the sum of B : D and C : D and (b) A : C is to going to be 
the product of A : B and B : C. What is not at all clear is that addition 
and multiplication are well-defined operations on the equivalence classes 
called ratios. This is exactly what is asserted in Proposition 24, Book V (for 
addition) and in Proposition 22, Book V (for multiplication) after a long 
and subtle sequence of intermediate steps. One stands amazed at Eudoxus’ 
mathematical skills. 


How about algebra, identities and formulas with the arithmetic operations? Euclid 
studies at length in Book II what people call ‘geometric algebra’, a series of propo- 
sitions which amount to algebraic identities such as 


(a +x)? + (a — x? = 2a? + 2x? 


which is essentially the content of Proposition 9, Book II. Now what about the 
solutions of quadratic equations? This seems to be essentially what the lengthy and 
confusing Book X is all about. As Heath points out in his introduction to Book X, 
Euclid's classification of binomials and apotomes can be read as a systematic study 
of all the positive roots of all possible quadratic equations. This sets the stage for 
the separation of cases in treating roots of polynomial equations in all the works we 
have reviewed. 

Allinall, if youare going to start with Euclid, you are not going to be predisposed 
to introduce negative numbers in to your calculations. He has gone to extraordinary 
lengths to reduce arithmetic and algebra to geometry and thoroughly inoculate it 
against negatives. It is worth looking briefly at what else was known at 300 BCE 
which Euclid did not put in his book. There is apparently an unbroken tradition 
starting in Babylon in 1800 BCE and continuing through Ptolemy of calculating 
with the sexagesimal equivalent of decimals and approximating e.g. 4/2 and z to 
many sexagesimal places. Moreover, there was also a tradition also going this far 
back of solving quadratic equations by algorithms — described in words but exactly 
equivalent to the quadratic formula. Euclid, in other words, distanced himself from 
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i a rich numerical tradition and consciously, it would seem, purified his version of 
1 PTT mathematics. 
TE The Europeans, then, had the benefit of this shining example of pure math and 
EE of the wonderful deductive logic on which they built. But it was hard to go beyond 
| it in any radical way, to model other phenomena in the real world which cried 
out for negatives. Euclid was both the strength and the weakness of the European 
Mul mathematical world of the 16" and 17" centuries. 
t i But I think there is a second factor behind the slow acceptance of negatives which 
We | f ought to be considered. As soon as one accepted — 1, the algebra of the day thrust 
1 l upon you formulae requiring its square root and this was truly inexplicable. The 
fate of —1 andi were inseparable. Cardano's book makes this very clear. We have 
already quoted from Chapter 37, near the end of his book, entitled On the Rule for 
| Postulating a Negative. The Chapter starts with the sentence: 


mediae» gm mem cde 


——— 


This rule is threefold, for one either assumes a negative, or seeks a negative square root, or 
seeks what is not. 


He is essentially equating three follies, all problematical. That he later entertained 
the idea that perhaps (—1)? ought to be equal to —1 shows how he viewed the 
problems as intertwined. Harriot (1560-1621) also played with both possibilities, 
as in the poem: 


Yet lesse of lesse makes lesse or more, 
Use which is best keep both in store 


D 


ns will cancel at the end but not before. The 
te in 1593? discovered that trisecting 
à c equation which belongs to: ithe 
M ar & 
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and he showed how to reduce the general casus irreducibilis to this special case. 
Thus he reduced a famous unsolved algebraic problem to a famous geometric one, 
unsolved in the sense that no ruler and compass construction was known (nor exists). 
At the same time, Bombelli proposed that Cardano's formula could make sense if 
you solved 


(x + V=) ac V= 


Sotrisecting an angle was related to taking complex cube roots — but no one put these 
together for a long time by finding the geometric meaning of complex numbers. Later 
we have Wallis, knowing the geometric meaning of negatives as the left half line, 
searching for a two dimensional geometric interpretation of imaginary numbers. 
There was a big clue on the table if anyone had linked Viéte’s trisection with taking 
cube roots of cemplex numbers. I believe it was Euler who finally worked out 
complex exponentials and made the link between these two. Oddly enough, even 
then Euler did not make explicit the geometric interpretation of complex numbers, 
leaving this to Wessel, Gauss and Argand. 

Finally, there is also the issue of a psychological explanation for avoiding negative 
numbers. As Tversky and Kanneman have made popular, people are ‘loss averse’, 
a loss of $x causes more pain than a gain of $x and they do not act rationally using 
mathematically correct expectations. The fear of loss is one of themes in Ionesco’s 
bizarre play The Lesson, where a young woman comes for a tutoring lesson: she 
can add with proficiency but cannot subtract. The mathematician doesn't come off 
very rational either: he winds up killing her. 

Mathematicians are attracted to Platonism, of believing that their discoveries 
are all insights into the eternal true world of mathematical facts. This example, 
the discovery of negative arithmetic and its incorporation into our numerical and 
algebraic toolkit, shows us that we must not be too literal. Yes, negative numbers 
were eventually accepted in the West as well as in China and India and all three 
cultures made the same math out of them. But there can be huge differences between 
cultures in the way mathematics unrolls. Euclid led the West down a certain path, 
dominated for many centuries by geometric figures and constructions. Othercultures 
were more practical and looked to solving concrete problems with approximate 
numbers. I think the discovery of calculus is another instance of this split: in India, 
studying the numerical table of sines led mathematicians to the idea of first and 
second differences and the fundamental theorem of calculus. But that is another 
story. 
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Kuttaka, Bhavana and Cakravala 


Amartya Kumar Dutta 


Ancient Indian mathematical treatises contain ingenious methods for finding integer solutions of 
indeterminate (or Diophantine) equations. The three greatest landmarks in this area are the kuttaka 
method of Aryabhata for solving the linear indeterminate equation ay — bx = c, the bhavana 
law of Brahmagupta, and the cakravála algorithm described by Jayadeva and Bhaskara II for 
solving the quadratic indeterminate equation Dx? + 1 = y?. We shall briefly recall the history 
of the above equations in ancient India and of their rediscovery in Europe, give an account of 
the ancient Indian texts dealing with algebra in general and the above indeterminate equations in 
particular, and mention a few works on history of mathematics which have highlighted ancient 
Indian algebra, especially indeterminate equations. We shall then discuss various mathematical 
aspects of the kuttaka, bhāvanā and cakravála from the viewpoint of modern algebra and number 
theory and the general cultural atmosphere in which the leading Indian mathematicians undertook 
such explorations. We shall also examine the coverage of these results in texts involving history 
of mathematics. 


Introduction 


In a plenary talk at the International Congress of Mathematicians (1978), André 
Weil! asserted ({Wel, p 231—232]): 


An understanding in depth of the mathematics of any given period is hardly ever to 
be achieved without knowledge extending far beyond its ostensible subject-matter. 
More often than not, what makes it interesting is precisely the early occurrence 
of concepts and methods destined to emerge only later into the conscious mind of 
mathematicians; the historian's task is to disengage them and trace their influence 
or lack of influence on subsequent developments. 


Indeed, one of the tendencies in recent research on history of mathematics has 
been to discover in ancient works the equivalents of modern concepts and ideas 
lying concealed in archaic language and style. For instance, I.G. Bashmakova's 
innovative studies? on the work of Diophantus, with this approach, have contributed 


l André Weil (1906-98), one of the giants of 20th century mathematics, noted for his outstand- 
ing contributions in number theory and algebraic geometry, had a lifelong interest in the history of 
mathematics. 

2A retrospective of the life and work of Isabella G. Bashmakova was published in Historia Mathe- 
matica 8 (1981), p 389-392, on the occasion of her 60th birthday; a more detailed and updated account, 
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to the revival of interest in Diophantus and enhanced the understanding of his work. 
However there is a dearth of literature which makes an analogous sophisticated 
analysis of ancient Indian works on indeterminate equations? (and other topics) in 
i | y the light of modern mathematics. 
Fi Although the basic facts regarding ancient Indian works on the indeterminate 
| equations ay — bx — c and Dx? -- 1 — y? are well-known among historians of 
! Indian mathematics, not many mathematicians, even in India, are aware of these 
‘ works. Consequently, there is a general lack of awareness about the heights attained 
| inancient Indian mathematics among scholars in other relevant disciplines (science, 
| | history, Indology, etc) as well as among general readers. Further, in several accounts 
An on history of mathematics, one often finds misleading remarks about these works. 
Some of the accounts also reveal inaccurate notions on general Indian history. 
l 
d 


In view of the above observations, the present paper has been prepared with the 
following aims: 


(i) to present a mathematically non-technical history of the equations which 
: will be accessible to readers who are not necessarily familiar with advanced 
j! mathematics; 


(ii) to initiate a study on the significance of the kuttaka, bhāvanā and the 
cakravála in the light of modern algebra and number theory and to explore 
their pedagogic potential; 


(iii) torecall relevant features of the cultural history of India during the “Classical 
Age"; 


(iv) to discuss some issues regarding the presentation of Indian works on inde- 
terminate equations in texts on history of mathematics. 


Roughly, sections 1—4 correspond to (i); sections 5-7 to (ii), and section 8 to (iii). 
Section 9 is specific to (iv); but the preceding sections, especially 3, 8 and parts of 
5—7 are all relevant to (iv). 

Section 1 gives a brief history of the methods for solving the equations 
= cand Dx? +1 = y? in ancient India and in Europe. Section 2 
unt of ancient Indian texts which deal with the equations while section 


n in more than one variable (or more generally, a system 
variables) is called indeterminate. The term is suggestive of 
many solutions. Diophantus of Alexandria (c. 250 CE) 
ıl numbers) of indeterminate equations with nteger 

s wil teger coefficients whose | solutio are 
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3 gives a partial account of the representation of Indian works on indeterminate 
analysis in texts on history of mathematics. Section 4 presents the history of de- 
velopment of certain topics of basic mathematics relevant to the kuttaka. Sections 
5-7 make a mathematical discussion on the kuttaka, bhavana and cakravála re- 
spectively. Section 8 highlights some features of the intellectual life of post-Vedic 
ancient India which are reflected in the legacy of Brahmagupta and other algebraists. 
Section 9 makes a critical discussion on certain views regarding the motivation of 
Brahmagupta and others in investigating the equation Dx? + 1 = y?. 

Readers interested in mathematical discussions may go directly to sections 5—7. 
Readers preferring non-technical discussions can first look at 1—3, 8, 9 and parts 
of 4. Parts of sections 5—7 (especially section 6) will also be accessible to general 
readers; they can go through these sections by skipping a few sentences involving 
mathematical technicalities. 


1. A Brief History of The Problems 
Indeterminate Analysis in Post- Vedic Ancient India 


The mathematical treatises of the Classical Age in ancient India exude the gen- 
eral intellectual robustness of the era. Two outstanding works of the period are: 
Aryabhatiya of Aryabhata (499 CE) and Brahma Sphuta Siddhanta of Brahmagupta 
(628 CE). These texts had a decisive influence on the course taken by Indian mathe- 
matics (and astronomy) for the next 1000 years. In particular, they were responsible 
for the emergence of a flourishing algebra culture in Indian mathematics. 

Apart from developing fundamental concepts and results of symbolic algebra’, 
the algebraists of the Classical Age revelled in the challenging problems of finding 
integer solutions of indeterminate equations. The first explicit description of the 
general integral solution of the linear Diophantine equation ay — bx = c occurs 
in the Aryabhatiya (499 CE) of Aryabhata. The algorithm is described in just two 
cryptic stanzas (verses 32 and 33) at the end of the chapter Ganita (mathematics). The 
solution was subsequently discussed, with variations and refinements, by several 
Indian mathematicians including Bhaskara I (c. 600 CE), Brahmagupta (628 CE), 
Mahavira (850 CE), Govindasvamin (c. 860 CE), Prthiidakasvamin (c. 860 CE), 


4Algebra, as commonly understood, begins with the use of symbols (letters of the alphabet to denote 
unknowns) and equations ({DS2, Preface]). This phase of the subject is sometimes called symbolic 
algebra — the adjective “symbolic” is used to distinguish it from ancient geometric and arithmetical 
works where algebraic knowledge and ideas are only implicit. 

Different dates have been given for Bhaskara I like 522 CE (B. Datta), 574 CE (Sarasvati Amma) 
and 629 CE (K.S. Shukla). Brahmagupta's ninth-century commentator Prthidakasvamin places Brah- 
queue (628 CE) to be later than Bhàskara I. We adopt 600 CE as a possible approximate date. See 

ar, p 9]. 
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Aryabhata II (950 CE), Sripati (1039 CE), Bhaskara II (1150 CE) and Narayana 
(c. 1350 CE). The algorithm was termed kuttaka (pulverizer), derived from the root 
kutt (to crush, to grind); the equation itself (or the problem of finding integer solutions 
of the equation) was also referred to as kuttaka. The kuttaka principle was considered 
so important that the subject algebra itself was initially called kuttaka-ganita, or 
simply, kuttaka. Bhaskara I gives 30 examples in his commentary Aryabhatiya- 
bhasya to illustrate the method for solving linear indeterminate equations (see 
[Shu4, p 309-334] and [Ke, 130—166] for the examples); examples are also given 
in his astronomy treatises Maha-Bhaskariya (see [Shu2, p 29-46]) and Laghu- 
Bhaskariya (see [Shu3, p 99-114]). The Mahasiddhanta of Aryabhata II contains 
a separate chapter (18) called kuttaka exclusively on the solution to the linear 
Diophantine equation, Bhaskara II discusses the kuftaka in two of his treatises 
Lilavati ((PNS, chapter 33, p 167—175]) and Bijaganita ([Ab, chapter 5, p 17-21]; 
[P, chapter 5, p 49-69]), Devaraja wrote a separate treatise Kuttakara-siromani 
exclusively on the topic ([DS2, p 88]). 

Having successfully dealt with the linear Diophantine equation, Indian alge- 
braists took up the harder problem of investigating the quadratic indeterminate 
equation Dx? + m = y?, called varga-prakrti (square-natured) in ancient India. 
They laid special emphasis on solving the important case Dx? +1 = y?, where D is 
a positive integer which is not a perfect square. The first major breakthrough in this 
problem was achieved by Brahmagupta in 628 CE through a brilliant innovation — 
a law of composition called bhavana. This composition principle of Brahmagupta 
is of paramount significance in modern algebra and number theory. 

Brahmagupta applied his composition rule to generate an infinite number of 
integer solutions from a given integer solution of Dx? + 1 = y? and also showed 
how to arrive ata given integer solution in a wide variety of cases (that is, for various 
values of D like D = 83 or D = 92). While he had thus given a partial solution to the 
problem, another Indian algebraist subsequently discovered the complete integer 
solution by a cyclic method called cakravala (cakra: wheel or disc) in ancient India. 
The discovery took place sometime during the 7th-1 1th century; for the cakravala 
algorithm has been described by Jayadeva (who lived prior to 1073 CE)’ and 
the famous astronomer-mathematician Bhaskara II (1150 CE). However, the true 
originatoris not known (see next section). Bhaskara II illustrated the cakravala with 


Sif D is negative, or if D is a square of a positive integer, then Dx? + 1 = y? has only finitely many 
integer solutions; in fact, it is easy to see that (0, +1) are the only solutions in all these cases except 
D x —1, and (41, 0); (0, +1) are the only solutions when D = —1. It is when VD is an irrational real 
number that the problem becomes mathematically interesting, and it turns out that the equation will have 
infinitely many solutions. 

Tyayadeva's verses on the solution of the indeterminate equation Dx?--1 = y? have been quoted inthe 
text Sundari of Udaydivakara composed in 1073 CE. Nothing is known so far about this mathematician 
Jayadeva. He is not to be confused with the Vaisnava poet Jayadeva of the 12th century who composed 
Gita-Govinda. 
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difficult numerical cases like D = 61 and D = 67. For the equation 61x? +1 = y?, 
the smallest solution in positive integers is x — 226153980, y — 1766319049, 
indicating the unexpected intricacy of the problem.? Narayana (c. 1350 CE) too 
discussed solutions of the equation Dx? + 1 = y?, illustrated the method with 
the cases D — 97 and D — 103, and showed how to use the solutions to give 
rational approximations to 4/D (see section 7). Besides the equations ay — bx = c 
and Dx? + m = y?, several other indeterminate and simultaneous indeterminate 
equations also occur in the treatises of some of the above mathematicians, especially 
Bhaskara II. ; 

The equations ay — bx = c and Dx? + 1 = y? are important equations in 
modern mathematics. But the Indian works on such indeterminate equations during 
the 5th- 12th centuries were too advanced to be appreciated or noticed by Arab and 
Persian scholars and did not get transmitted to Europe during the medieval period. 
For instance, when Fyzi translated Bhaskara II's treatise Lilavati? into Persian, he 
omitted the portion on indeterminate equations. 

We now narrate some of the landmarks in the history of the two indeterminate 
equations ay — bx = c and Dx? + 1 = y? in post-Renaissance Europe. 


Indeterminate Analysis in Modern Europe 


Problems in indeterminate equations were taken up in Europe during the 17th 
century. These problems fascinated some of the best European mathematicians of 
the 17th-18th century who not only rediscovered solutions to the problems but also 
developed general theories and techniques in this connection. 

The integer solution of the linear Diophantine equation was described in Europe 
forthe first time by C.G. Bachet de Méziriac (1581—1638) more than eleven centuries 
after Aryabhata. In the first edition (1612 CE) of his book Problémes plaisants et 
délectables qui se font par les nombres"”, Bachet stated that if a and b are relatively 
prime integers, then one can find a least integral multiple of a which exceeds an 
integer multiple of b by a given integer c. The proof was given in the second edition 
of the book (1624). Dickson mentions that Bachet “employed notations for 18 
quantities, making it difficult to hold in mind the relations between them and so 
obtain a true insight into his correct process" ([Di, p 44]). Bachet also mentioned 


SContrast it with the minimum solution of 60x? + 1 = y^: itisx = 4, y = 31. 

9A popular text on arithmetic and mensuration with some algebra. It was translated by Fyzi from 
Sanskrit into Persian, at the behest of Emperor Akbar, in 1587 CE. 

\0Pleasant and delectable problems to be solved by numbers. Bachet had been interested in math- 
ematical recreations and puzzles. This book is a collection of such puzzles. In 1621, Bachet published 
a bilingual edition (Greek original with Latin translation) of Diophantus’ Arithmetica with extensive 
commentary. This work was largely responsible for the resurgence of number theory in Europe. Weil 
remarks ([We2, p 33]): “. . . his is the merit of having provided his successors and notably Fermat with 
à reliable text of Diophantus along with a mathematically sound translation and commentary." 
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his solution in his edition (1621) of Diophantus' Arithmetica emphasising that the 
method is his own ([We2, p 7]). After Bachet's work, several mathematicians in 
Europe continued to discuss various approaches to the equation ay — bx — c. Some 
of the early names include J. Kersey (1673), M. Rolle (1690), T.F. de Lagny (1697), 
L. Euler (1735), N. Saunderson (1740), W. Emerson (1764), J.L. Lagrange (1767), 
J. Bernoulli (1772) among many others; see [Di, Ch II]. As late as in 1798 (nearly 
two centuries after Bachet), the great Legendre wrote in the preface of his Theory 
of Numbers (quoted in translation in [G, p 3]): 


Bachet,..., solved the indeterminate equation of the first degree by a general and very 
ingenious method. 


Pierre de Fermat (1601—65), who is regarded as the father of Modern Number 
Theory, tried to create enthusiasm for the subject among his contemporary math- 
ematicians like John Wallis (1616—1703) and Lord William Brouncker (1620-84) 
of England, B. Frenicle de Bessy (c. 1612—75) of France and F. van Schooten”! 
(1615—60) of Netherlands. Fermat, who had carefully read Bachet's version of 
Diophantus, laid special emphasis on methods for finding integer solutions of equa- 
tions as distinct from the easier, though important, question of finding rational 
solutions that occurs in Diophantus. One of the problems through which Fermat 
chose to highlight the beauty and intricacy of number theory was the problem of 
finding integer solutions to Dx? + 1 = y?, a problem explored by Brahmagupta 
in 628 CE and solved by an Indian algebraist prior to the 11th century. Among 
the two specific examples that Fermat proposed to Frenicle in 1657 CE was the 
case D = 61 which occurs in Bhaskara II's Bijaganita (1150 CE).'? To get some 
feel for the intensity of Fermat's involvement with number-theoretic methods, es- 
pecially the equation Dx? -- 1 — y?, we may cite excerpts from his letter (1657) 


presenting his challenge to the English mathematicans (translation by Heath; quoted 
in [G, p 17]): 


Thereis hardly anyone who proposes purely arithmetical questions, hardly anyone who under- 
stands them. Is this due to the fact that up to now arithmetic!? has been treated geometrically 
rather than arithmetically? This has indeed been the case both in ancient and modern works; 
even Diophantus is an instance. For, although he has freed himself from geometry a little more 
than others have in that he confines his analysis to the consideration of rational numbers, . . . 


Now arithmetic has, so to speak, a special domain of its own, the theory of integral numbers. 
This was only lightly touched upon by Euclid in his Elements, and was not sufficiently 


1! A Dutch mathematician who is most famous for popularising the cartesian geometry of Descartes. 

12The other case proposed by Fermat was D = 109. The smallest solution of 109x? + 1 = y? is 
given by x = 15140424455100, y = 158070671986249. 

13 Here “arithmetic” refers to “higher arithmetic", i.e., number theory, the study of abstract properties 
of numbers. In Greece, usual computational arithmetic was called “logistic” while the term “arithmetic” 
meant number theory. 
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studied by those who followed him .. . ; arithmeticians have therefore now to develop it or 
restore it. 

To arithmeticians therefore, by way of lighting up the road to be followed, I propose the 
following theorem to be proved or problem to be solved. If they succeed in discovering the 
proof or solution, they will admit that questions of this Kind are not inferior to the more 
celebrated questions in geometry in respect of beauty, difficulty or method of proof. 


Given any number whatever which is not a square, there are also given an infinite number of 
squares such that, if the square is multiplied into the given number and unity is added to the 


product, the result is a square. 
Regarding Fermat’s challenge, André Weil remarked ([We2, p 81—82]): 


What would have been Fermat's astonishment if some missionary, just back from India, had 
told him that his problem had been successfully tackled there by native mathematicians almost 


six centuries earlier! 


A general method for solving Fermat's problem was discovered by Brouncker in 
1657.!* During 1657-58, there had been an exchange of letters among mathemati- 
cians like Frenicle, Brouncker and Wallis who had taken interest in Fermat's prob- 
lem. These letters were published by Wallis in Commercium Epistolicum (1658). 
In 1685, Wallis published his monumental work A treatise of algebra both histor- 
ical and practical (in short, Algebra) comprising 100 chapters. Chapter 98 of this 
treatise was devoted to Fermat's problem and was based on the correspondence pub- 
lished in Commercium Epistolicum. Brouncker's method for solving the equation 
Dx? +1 = y? was described in this chapter in a standard form. An enlarged second 
edition of the treatise A/gebra was published as the second volume of Wallis' Opera 
Mathematica (1693). 

Wallis had attempted to prove that Dx? + 1 = y? always has a positive integral 
solution, but used an incorrect result ((Di, p 354]). Fermat had asserted in his 
correspondences of 1659 that he had proved by his method of "descent" that the 
equation Dx? + 1 = y? has infinitely many integer solutions (when D is a positive 
integer which is not a perfect square). The proof has not been found in any of his 
writings. 

Fermat's problem was again taken up in the next century by the two greatest 
figures of 18th century mathematics: L. Euler (1707-83) and J.L. Lagrange (1736— 
1813). Euler became interested in the problem from around 1730 and Lagrange took 
it up around 1768. They developed the theory of continued fractions, investigated 
the problem in the framework of this theory and established that if D is nota perfect 
square, then V/D has an infinite but periodic continued fraction expansion, and all 


4 

HIn response to a challenge posed by Frenicle, Brouncker was able to use his method to find the 
(smallest) solution x-1819380158564160, y-32188120829134849 to the equation 313x? + 1 = y 
which he claimed had taken him only “an hour or two" ((OR]). 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


152 Amartya Kumar Dutta 
| oa solutions (p, q) to the equation Dx? + 1 = y? are given by certain convergents 
|t 3 z of the expansion.'? Moreover, there exists a positive integral solution (xi, yi) 
Hu of Dx? + 1 = y? (called the fundamental solution) such that all positive integral 
l solutions are given by (x, Yn) where x,,, y, are defined by the relation y, -- / Dx, = 
i i+ V Dxi)". While the initial discoveries were made by Euler, it was Lagrange 
who first published formal proofs of all these results during 1768—69. He included 
them in his Additions to Euler's Elements of algebra (composed in 1771, published 
| ! in 1773). Brouncker's method could be re-interpreted in the language of continued 
H | fractions; in that presentation, it becomes equivalent to Euler's method. The precise 
pepe | historical details are given in [We2, p 183; 229-233]. 
[ 


Selberg mentioned in an interview!Ó that André Weil once said that if something 

in mathematics gets attached to the name of a person, then the person in question 
i usually has very little to do with it. The equation Dx? + 1 = y? was attributed to 
] the English mathematician John Pell (c. 1611-85) by Euler although there is no 

evidence that John Pell had seriously investigated the equation. Euler's confusion 

could have been created through a cursory reading of Wallis’ Algebra, a large part of 
which was devoted to the work of five English mathematicians: Oughtred, Harriot, 
Pell, Newton and Wallis himself. In any case, the name “Pell’s equation" stuck. As 
Weil put it ([We2, p 174]): 


j 

j 

i The Label “Pell’s Equation" 
1 

| 

| 


Pell's name occurs frequently in Wallis's Algebra, but never in connection with the equation 
x2 — Ny? = 1 to which his name, because of Euler’s mistaken attribution, has remained 
attached; since its traditional designation as “Pell’s equation" is unambiguous and convenient, 
ris - we will go on using it, even though it is historically wrong. 

z * MEN: m 
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ted by R. Sridharan ([Sr2, p 17]), the equation should be called “Brah- 
| " as a tribute to the genius who first made a contribution to the 
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Sulba-sütras (c. 800 BCE), the most ancient extant mathematics treatises, 
377 have been used as approximations for V2 ([D2, p 202]; [Di, p. 341]; [Sar, 


408 

p 18]). These three fractions may be interpreted as arising out of solutions of 
2x? +1 = y?; for 2 x 5 — 1 = 7,2 x 122 + 1 = 17 and 2 x 408 + 1 = 
577°. In fact, they are respectively the third, fourth and eighth convergents of 
the simple continued fraction expansion of 4/2. We shall revisit these fractions in 
section 7. 

Archimedes (287-212 BCE) gave the approximations 29$ and 43! for /3. In 
his commentary on the work of Archimedes, Eutocius (c. 480-540 CE) mentions 
the relations 265? — 3 x 153? — —2 and 1351? — 3 x 780? — 1 as a verification of 
the validity of the approximations ([We2, p 16]). 

In 1773, G.E. Lessing discovered an ancient Greek epigram which states a 
problem (known as the “Cattle Problem") which is believed to have been com- 
municated by Archimedes to Eratosthenes for the mathematicians of Alexandria. 
In this problem one is required to find eight integers (number of bulls and cows 
each of four colours) satisfying nine conditions linear and quadratic. After some 
algebraic manipulations, the cattle problem reduces to the problem of finding a 
positive integral solution to Dx? + 1 = y? where D = 4729494. There is no 
evidence that Archimedes had made this connection ((OR]) and it is thought ex- 
tremely unlikely that either Archimedes or any of his contemporaries had actu- 
ally solved the cattle problem. The problem was first solved by A. Amthor (1880 
CE); the smallest value of one of the variables in the cattle problem is a number 
having 206545 digits (see [Di, p 342—345], [L] and [Wi, p 400—402] for further 


details). 


UMS 
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Cakravala Revisted 


The Indian cakravala needs fewer steps than the later methods of Brouncker and 
Euler; however a formal justification for the cakravala method is less elegant. Some 
mathematicians therefore prefer to describe the Brouncker-Euler algorithm when 
writing on the cakravala for students (see, for instance, [Vr, p 28-35]). 

A formal proof that cakravala method will always work (that is, terminate after 
a finite number of steps) was given by Krishnaswamy Ayyangar in 1929 ([Kr1]). 
Ayyangar related the method with the theory of a half-regular continued fraction! 
which he developed using only the kind of elementary mathematics which would 
have been known in India by the Sth century CE. 

Another approach was given by C.O. Selenius (1960) who developed a gener- 
alised continued fraction expansion, which he called “ideal expansion”, used it to 


7 
See [Kr1], [Kr2], [Kr3] for Ayyangar’s work and [MRW] for recent applications of Ayyangar's 
theory of "nearest square continued fraction". 
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I give an algorithm for solving the equation Dx? +1 = y? in which the number of 
11143 steps is minimised, and showed that his "ideal expansion" method is equivalent to 
the Indian cakravala ([Sel]).'* 


Legacy of “Pell’s Equation" 


The study of indeterminate equations, especially the study of the so-called “Pell’s 
Eu equation", played an important role in the evolution of classical algebra in ancient 
India and later in modern Europe. '? 

Pell'sequation has had applications throughout history. Large integer solutions of 
Dx? 4-1 = y? have been used in ancient times to yield good rational approximations 
to V/D (see section 7). The solutions of the equation yield units in the domain of 
i integers of the quadratic field Q(/D). The equation is also closely related to the 
HM study of binary quadratic forms. 
| The solution of Pell's equation is the main step in the solution of the general 
; I quadratic Diophantine equation in two variables. It also played a role in the solution 

(1970) of Hilbert’s 10th problem on the non-existence of an algorithm for solving 
arbitrary Diophantine equations. 

Pell's equation continues to fascinate mathematicians even today. Research pa- 
pers and articles are being published on it in various contexts; two of the recent 
papers ({Ha], [MRW])are listed in the bibliography. An extensive bibliography on 
the equation and on methods for solving it can be found in the 2002 paper ([Wi]) of 
Hugh Williams. The algorithmic efficiency of various methods are discussed in the 

| article of H.W. Lenstra ([L]). An expository book ([Br]), exclusively on the topic, 


has been published recently (2003). The author E.J. Barbeau remarks in the Preface: ? 


Pell’s equation seems to be an ideal topic to lead college students, as well as some talented 
. -= and motivated high school students, to a better appreciation of the power of mathematical 
Ix sce ^" 


Ame 


of solutions of the equation is a very active area of research in 
ər theory and computer science. While there does not exist any 
hm for solving Pell's equation, S. Hallgren has exhibited a 
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polynomial-time quantum algorithm for the equation ((Ha]). As Lenstra remarks 
([L. p 192]): 


The last word on algorithms for solving Pell's equation has not been spoken yet. 


2. Texts on Algebra in the Classical Age 


As mentioned earlier, the subject of algebra was developed during the Classical Age, 
consciously, as a distinct and important branch of mathematics. The text Áryabhatiya 
(499 CE) already contains algebraic ideas, like a term gulikà ("shot"; possibly 
"coloured shot") for an unknown quantity and algebraic results like solutions of 
linear and quadratic equations, formulae for arithmetic progression and other sums 
of finite series and, of course, the integer solutions of linear Diophantine equations. 
Bhaskara I (c. 600 CE) discusses the kuttaka (the linear Diophantine equation) in 
detail in the first chapter (verses 41—52) of his astronomy treatise Maha-Bhaskariya 
([Shu2, p 29—46]), apart from the discussions, with examples, in his commentary 
Aryabhatiya-bhàsya ([Ke, 128—166]). 

Itis however the Brahma Sphuta Siddhànta (628 CE) of Brahmagupta which laid 
a firm foundation for the science of algebra in India. This is the first known ancient 
Indian text containing a separate chapter on algebra. Chapter 18 of this volumi- 
nous treatise (comprising over 1000 verses in 24 chapters) is titled Kuttakadhyayah 
(= Kuttaka (algebra) + adhyayah (chapter)). Towards the beginning and the end 
of this chapter, Brahmagupta emphasises the importance (rather indispensability) 
of the subject and its charm. He systematically expounds the basics: describes 
the use of symbols for unknowns, defines zero as an integer in algebra, intro- 
duces negative numbers and prescribes the rules of arithmetic operations in the 
enlarged number system (including negative numbers and zero), discusses the 
arithmetic of surds, explains the principles of formation and manoeuvring of 
equations (plan of writing, arrangement, clearances, etc), gives methods for solv- 
ing quadratic equations and simultaneous linear equations in several unknowns 
(anticipating the Gaussian method of elimination), etc. A substantial portion of 
the chapter on algebra discusses indeterminate equations of the first and second 
degree. 

The impact of Brahmagupta on algebra can be seen from the fact that sep- 
arate monographs on the subject (usually titled Bijaganita*') were brought out 
subsequently by prominent mathematicians like Sridhara (750 CE), Padmanabha 
(date unknown but before Bhaskara II), Sripati (1039 CE), Bhaskara II (1150 CE), 


21 The subject of algebra gradually acquired the current Sanskrit term bijaganita. Bija means “seed” 
“root”, “element”, as well as "analysis"; thus bijaganita literally means the “mathematics with elements 
(letters)” as well as “analytical mathematics”. 
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Narayana (1350 CE) and later mathematicians. Unfortunately, the specialised al- 
gebra texts of Sridhara, Padmanabha and Sripati — the predecessors of the great 
Bhàskara II — have not been found.” 

Apart from algebra texts, the general texts on mathematics — like the Ganita- 
sára-sangraha (750 CE) of Mahavira — also contain several results on topics 
in algebra like quadratic equations, summations of finite series (like arithmetic 
progression (AP) and geometric progression (GP)), permutations and combinations, 
and sometimes even the kuttaka. Some of the astronomer-mathematicians wrote 
separate chapters on algebra in their general texts. Chapter 14 of the astronomy 
treatise Siddhüntasekhara of Sripati, entitled avyakta-ganita, is on algebra. 

Discussions on the telatively advanced or specialised topics in algebra, like solu- 
tions of quadratic indeterminate equations, were usually confined to the Bijaganita 
texts. The Bijaganita of Bhaskara II contains — apart from basic results of algebra 
— a lucid exposition of the kuttaka, bhavana and the cakravála, and mentions sev- 
eral other indeterminate equations. Bhaskara II attributes the cakravála to ancient 
authors without specifying any name. At the end of his treatise, Bhaskara II makes a 
general acknowledgement of the prolific algebra works of Brahmagupta”, Sridhara 
and Padmanabha as his chief sources. There is therefore a strong possibility that the 
lost treatises of Sridhara and/or Padmanabha too contain an account of the cakravala 
algorithm. The history of cakravala became more mysterious with the discovery 
(in 1954) of a work dated 1073 CE which quotes verses from a preceding algebraist 
Jayadeva describing the cakravala algorithm (along with the bhavana). Curiously, 
Jayadeva is not mentioned by Bhaskara II. 

It is remarkable how early the leading Indian mathematicians had realised the 
significance of algebra and how strongly they asserted and established the impor- 
tance of their newly-founded discipline. Brahmagupta declares at the outset of his 
Kuttakadhyayah (chapter on algebra) that algebra is indispensable for solving most 

> problems, gives a list of topics in algebra, and declares that one who has mastered 
those topics will be venerated as an Gcarya among the learned. And, towards the 
end of the chapter, he celebrates the delight and glory of the subject through the 
words ([Sha, Ch 18; verses 99-100]; [Dul, p 104]): 


These questions are stated simply for delight. One may devise a thousand others, or may 
resolve the problems proposed by others, by the rules given here. 


As the sun eclipses the stars by his brilliance, so will an expert eclipse the glory of other 
astronomers in assemblies of people by the recital of algebraic problems, and still more by 
their solution. 


22Bhaskara II refers to the extensive algebra works of Sridhara and Padmanabha ([Ab, p 52]; [P, p 
196]); Munisvara (c. 1608 CE) mentions the Bijaganita of Sripati ([Sih, p 26]). 

23The verse of Bhaskara II ((Ab, p 52]; [P, p 196]) actually mentions Brahma. Itis generally presumed 
by historians that Bhaskara II was referring to Brahmagupta. 
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In order to emphasise the importance, power, and profundity of algebra, the 
poet-mathematician Bhaskaracarya (Bhaskara II) begins his treatise Bijaganita 
with an Invocation involving an interesting “pun” on the words Sarikhyah (the 
Saàmkhya philosophers as well as the experts in samkhyd, the science of num- 
bers), Satpurusa (the Self-Existent Being as well as the wise mathematician), bija 
(root/cause as well as algebra) and vyakta (the manifested universe as well as the rev- 
elation of an unknown quantity). Thus, through the opening verse, Bhaskaracarya 
venerates the Unmanifested — the Self-Existent Being of the Samkhya philoso- 
phy — who is the originator of intelligence and the primal Cause of the known 
or manifested universe; and, through the very same words, Bhaskaracarya pays 
tribute to the wise mathematician who, using algebra, solves a problem (i.e., re- 
veals or manifests an unknown quantity). The importance of algebra is reiterated 
at the end of Bijaganita. Bhaskaracarya remarks that algebra is the essence of all 
mathematics, is full of virtues and free from defects; and that the cultivation of 
algebra will sharpen the intellect of children. He concludes with the exhortation 
“patha patha” (Learn it, learn it) for the development of intelligence ([Ab, p 7; 53]; 
[P, p 4; 198]). 

Another algebraist Narayana (1350) exalts the subject with the following tribute 
([DS2, p 5]): 


As out of Him is derived the entire universe, visible and endless, so out of algebra follows the 
whole of arithmetic with its endless varieties (of rules). Therefore, I always make obeisance 
to Siva and also to (avyakta-) ganita (algebra). 


3. Ancient Indian Algebra in History of Mathematics 


In this section, we give a brief account of the discovery of some of the above works 
by modern historians and of expositions on them in some of the important early 
texts on history of mathematics. 

English translations of ancient Indian mathematics texts, especially of 
Bhaskara II, began appearing from early 19th century. Bhàskara's Bijaganita (alge- 
bra) was translated from Persian™ into English by E. Strachey under the title Bija 
Ganita or the Algebra of the Hindus (London, 1813). The Lilavati of Bhaskara was 
translated from the original Sanskrit by John Taylor (Bombay, 1816). 

The work during this period which, perhaps, had the greatest impact in generating 
an awareness about ancient Indian mathematics was the translation by H.T. Cole- 
brooke titled Algebra, with Arithmetic and Mensuration, from the Sanscrit of Brah- 
megupta and Bhascara (London; 1817). Colebrooke translated the texts Lilavati and 
Bijaganita of Bhaskara II and the chapters Ganitadhyayah (Ch 12; arithmetic and 
mensuration) and Kurtaküdhyàyah (Ch 18; algebra) of Brahma Sphuta Siddhànta 


2 Based on the Persian translation of 1634 CE by Ata Alla Rushdi. 
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ES of Brahmagupta, along with selections from the reputed commentary on Brāhma 

41 Sphuta Siddhanta titled Vasanabhasya (c. 860 CE) by Caturveda Prthüdakasvaàmin. 
y i Colebrooke also wrote a long preface where he discussed some of the distinguish- 

F ing features ofthe texts. This preface was reproduced in Colebrooke's Miscellaneous 

| | Essays (1837) and its contents thereby got the attention of a wide community of 
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serious scholars on India’s cultural history. One of the features of Indian algebra 
that Colebrooke highlighted was ([Co2, p 438]): 


General methods for the solutions of indeterminate problems of first and second degrees, 
in which they went far, indeed, beyond Diophantus, and anticipated discoveries of modern 
algebraists. 


For some time during the 17th—-19th centuries, Aryabhatiya had been unavailable 
to scholars of Indology, though its existence and contents had been known through 
other available texts. In 1817, Colebrooke laments ([Co2, p. 422]): 


i | | | A long and diligent research in various parts of India, has, however, failed of recovering any 
part of the Padmanabha vija (or Algebra of Padmanàbha??), and of the algebraic and other 
| works of Áryabhatta. 


Although a few manuscripts of Aryabhatiya were available in parts of Kerala, In- 
dologists did not come across them for several decades till Bhau Daji?6 discovered 
a manuscript in 1864. The text, with the commentary by Parame$vara, was then 
published by H. Kern in 1874 at Leiden (Netherlands). Subsequently, there have 
been several English translations of Aryabhatiya with commentaries by historians 
of science like P.C. Sengupta from Calcutta (1927) and W.E. Clark from Chicago 
| . In the 1500th birth anniversary year of Aryabhata (b. 476 CE), the In- 
cience Academy (INSA) published the Aryabhatiya with English 
by K.S. Shukla and K.V. Sarma ([ShuS]), along with the com- 
and Some$vara (edited by K.S. Shukla) and the commentary 

M amo). Recently, A. Keller has prepared an English 
a I’s Aryabhatiya-bhasya which discusses the 
abhati bhatiya ([Ke]). 
SUCE 


o f Mahāvīra was discovered in early 
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The works of Bhàskara I were discovered by B. Datta? in 1930 and published by 
K.S. Shukla during 1960-76 ([Shu2], [Shu3], [Shu4]). 

In 1954, K.S. Shukla discovered the text Sundari (1073 CE) of Udaydivakara, 
a commentary on the Laghu-Bhaskariya of Bhaskara I, and found that it contains 
quotations from a hitherto unknown mathematician called Jayadeva on the bhavana 
and the cakravála. He published the verses in [Shu1]. 

The book by Datta-Singh ([DS2]), published in 1938 and reprinted in 1962, 
remains the richest reference book on ancient Indian algebra. Much information 
on indeterminate analysis is recorded in the book. However there is not much 
discussion on the significance of the works from the viewpoint of modern algebra. 
The ideas of abstract modern algebra had not become commonplace among the 
general mathematical community in India in the 1920s and 30s when Datta did his 
pioneering work on history of mathematics. 

A few books published from the West in early 20th century had incorporated facts 
on ancient Indian mathematics with reasonable accuracy. L.E. Dickson's History 
of the Theory of Numbers Vol II (1920) contains considerable mention of Indian 
contributions to the study of indeterminate equations. In fact, in the Preface, he 
makes a special mention of Brahmagupta's work ([Di, p. xi]): 


It is a remarkable fact that the Hindu Brahmegupta?? in the seventh century gave a tentative 
method of solving ax? + c = y? in integers, which is a far more difficult problem than its 
solution in rational numbers. 


However, Dickson was not aware of the discovery of Aryabhatiya in 1864 and, like 


Colebrooke, refers to it as a lost treatise ([Di, p 41]). 
The book History of Mathematics by F. Cajori (1919) contains a short chapter 
“The Hindus” where there is a brief but admiring reference to Indian works on 


indeterminate equations ([Ca, p 94—98]): 


... Indeterminate analysis was a subject to which the Hindu mind showed a happy adap- 
tation. We have seen that this very subject was a favorite with Diophantus, and that his 
ingenuity was almost inexhaustible in devising solutions for particular cases. But the glory 
of having invented general methods in this most subtle branch of mathematics belongs to 
the Indians. The Hindu indeterminate analysis differs from the Greek not only in method, 
but also in aim. The object of the former was to find all possible integral solutions. Greek 
analysis, on the other hand, demanded not necessarily integral, but simply rational answers. 


27Bibhutibhusan Datta (1888-1958), later Swami Vidyaranya, is a pioneer among historians of 
Indian mathematics. His numerous papers contain a wealth of information and insights for anyone 
seriously interested in the history of ancient Indian mathematics; unfortunately most of them are not 
easily accessible. A D.Sc. in Applied Mathematics, Datta possessed the rare combination of professional 
mathematical experience with deep knowledge of Sanskrit literature. Apart from his books on history of 
Indian mathematics ([D2], [DS1], [DS2]), he has also written a few books on Indian philosophy. 

28The effect of Colebrooke on Dickson can be seen in the spelling of Brahmagupta. 
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NI Diophantus was content with a single solution; the Hindus endeavored to find all solutions 
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Remarkable is the Hindu solution of the quadratic equation cy? = ax? + b. With great 
HT keenness of intellect they recognized in the special case y? = ax? + 1 a fundamental problem 
|l | ET! in indeterminate quadratics. They solved it by the cyclic method. . . . 
i 


> Doubtless this “cyclic method” constitutes the greatest invention in the theory of numbers 
t before the time of Lagrange.?? 


. - Unfortunately, some of the most brilliant results in indeterminate analysis, found in Hindu 


i works, reached Europe too late to exert the influence they would have exerted, had they come 
two or three centuries earlier. 


i While several pioneering scholars like Colebrooke, Hankel and Thibaut at- 
tempted objective presentations of different portions of ancient Indian mathemat- 
ics, there were other influential writers, steeped in Eurocentric views, who sought 
to undermine it. The authors who were prejudiced in their portrayal of ancient 
| i f Indian mathematics did not even bother to be accurate or careful in conforming 
| | j ET 1 to known facts. For instance, any work on the Pellian equation having some so- 
| phistication was believed to be of Greek origin; and many authors who favoured 
the hypothesis of Greek origin did not have clear ideas regarding the distinction 
between Brahmagupta's brilliant but partial work and the subsequent complete 
cakravála algorithm (which is a strong indicator that the research was indigenous). 
Two errors commonly occurred: there were some who had the impression that the 
final cakravála method also occurs in Brahmagupta's work; and there have been 
writers who thought that Bhaskara II did not go beyond Brahmagupta's partial 
work. We refrain from giving details in this article. While there has been a recog- 
) the bias of certain writers, their warped views and erroneous remarks 
an influence on several subsequent authors and result in a distorted repre- 


dinum ftra — rmt 


r CTS with passage of time. But in the subsequent period there 
h of « GENET scholarship on ancient Indian mathematics 


t always available easily. Let us consider the 
cient texts relevant to the present article. 
ES) has long been out-of-print. As far 
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were last published in 1966 ([Sha]), with Hindi translation. Not all chapters (not 
even all the mathematics chapters) of Brahmagupta's magnum opus have English 
translations. Colebrooke’s partial translations ((Col1]), though valuable, are not 
always lucid. Jayadeva's verses occur only in the 1954 paper ([Shu1]). The picture 
is somewhat better with respect to the mathematics texts of Bhaskara II. A well- 
known publisher has brought out an English edition, with notes, of Lilavati ((PNS ]), 
based on the Marathi work of N.H. Phadke. Two useful recent editions of Bijaganita 
are mentioned in the bibliography ([Ab]; [P]); however, neither is free from errors. 
Publications on ancient commentaries of the original treatises are even more scarce. 
As M.D. Srinivas emphasises in [Srn], commentaries are no less important than 
the original treatises for a proper comprehension of the history of ancient Indian 
mathematics. 

Results like bhdvand and cakravála have evoked deep appreciation from mathe- 
maticians of high eminence, as can be glimpsed from the statements of Weil (sections 
1,5,7) and Atiyah (section 9) or the tribute of Ojanguren (section 6). However, such 
observations occur at diffused places — as incidental remarks in a sophisticated 
book, in an isolated lecture, or in a highly specialised research monograph — far 
removed from the attention of the general scholar. Going by some of the books on 
history of mathematics in recent decades, it would appear that not all historians are 
sufficiently aware of how highly the ancient Indian works are regarded by some of 
the great mathematicians of modern times.” 


4. Long Division, GCD and Euclidean Algorithm 


Before commencing our discussion on the kuttaka, we shall make a brief mention of 
the Euclidean algorithm which isa key step in the solution of the linear indeterminate 
equation. We first discuss the antiquity of the method of “long division” in ancient 
India. 


Computational Arithmetic in Ancient India: Long Division 


Familiarity with the four fundamental operations of arithmetic is evident in Vedic 
literature like the Satapatha Brahmana, the Taittiriya Samhita and even the 


3 

Many accounts, even by highly respected historians of mathematics, omit Indian contribution 
altogether. To give two examples:- The History of Mathematics: A Reader (1987) by J. Fauvel and J. 
Gray has no representation of mathematics in India; the monograph The Beginnings and Evolution of 


Algebra (2000) by I.G. Bashmakova and G.S. Smirnova has no mention of the algebra of post-Vedic 
ancient India. 
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Rg-Veda. A passage from the ancient Satapatha Brahmana (quoted in [Sr2, p5-6p 
gives all divisors of 720. The Sulba-sütras show clear use of arithmetic operations 
including division; they even give examples of division by fractions. We however 
do not know about the actual techniques used in the Vedic era. 

Now, although there was along tradition of computational mathematics in ancient 
India, a major difficulty in the reconstruction of this history is that the post- Vedic 
mathematical works preceding Aryabhatiya (499 CE) are lost.*! Aryabhata men- 
tions at the beginning (verse 1) of the mathematics chapter Ganita of Aryabhatiya 
that he is recording ancient knowledge “honoured at Kusumapura". His commen- 
tator Bhaskara I makes an incidental reference to names like Maskari, Pürana, 
Mudgala, Pütana, referring to them as Ácáryas (Masters). However nothing is 
known to us about such revered mathematicians. 

The gaps in our knowledge regarding the development of computational tech- 
niques in India may be illustrated by the history of “division”. A variant of the 
modern method of “long division" is described in the arithmetic treatises of 
Sridhara (750 CE), Mahavira (850 CE) and later writers (see [DS1, p 150—154] 
for details). But how old is their method of “long division"? Aryabhata's inge- 
nious algorithms for computing square roots and cube roots (which are again 
slight variants of the present methods) involve the technique of long division. 
However, neither Aryabhata (499 CE) nor Brahmagupta (628 CE) bother to ex- 
plain any method of division separately. They take for granted the knowledge of 
the long division method when they describe their algorithms for extraction of 
Square roots and cube roots. One can only conclude that the device had become 
firmly established well before the 5th century and that the great Masters Aryabhata 
and Brahmagupta considered the method to be too elementary for their respective 
treatises. 

We mention here that the operation of division was regarded as a difficult op- 
eration by European scholars from ancient times till as late as the 15th century. 
Luca Pacioli remarked in 1494 CE: “if a man can divide well, everything else is 
easy, for all the rest is involved therein." ([Sm, p 132]). Not much is known about 
ancient European methods of division. The Indian version of long division was 
transmitted to the Arabs and occurs in Arabic works from the 9th century onwards. 
From the Arabs, the method went to Europe where it came to be known as the galley 
(galea, batello) method (also called “scratch method") which became very popular 
in Europe during the 15th-18th centuries. The method was very convenient in the 


3lperhaps the original text of the Baksali manuscript could be anterior to Aryabhatiya. Its date is 
controversial; some historians like B. Datta place it at around 200 CE. 

32Evidently they had composed voluminous mathematics texts; for Bhaskara I explains why 
Aryabhata records only a “bit of mathematics" ([Sar, p 68-69]; [Shu4, p liv-lv]): “This has not been 
composed as a treatise by ácaryas Maskari, Purana, and others. with even one lakh verses for each 
[topic]. How can the Acarya [Aryabhata] manage [to state all of] it within such a short treatise?" 
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earlier system where erasing of figures was easy and desirable." The paper-and-ink 
System was less conducive for it and eventually led to the modern arrangement — 
a modification of the galley method. 


Greatest Common Divisor (GCD) 


As the name suggests. the GCD of two positive integers is the largest integer whic 

divides both the numbers (without leaving any remainder). As we shall see in next 
section, the first major step in the kurtaka is the computation of the GCD of two 
numbers. In the Árvabkariva, the algorithm for computing GCD is alluded to in a 
terse way. In fact, all expositors on kurtaka — even those who are generally more 
elaborate — refer to the algorithm for computing GCD by repeated division without 


much elaboration. 

One can only conclude that, before the time of Arvabhaia, ancient Indians had 
become familiar with the technique of computing GCD. presumably by tong divi- 
sion. As is the case with most concepts and results occurring in Arvabhativa. we tack 
the precise knowledge regarding the genesis of the concept of GCD in ancient India. 

Amidst all literature that is available to us, Buclicd’s Siemenss is the oldest mxi 
which describes an algorithm for computing the GCD. However therc is noconorere 
evidence that this algorithm was transmitted to Indian mathematicians Ss one dacs 
not see any influence of other re the Llements in the mathomanos o 
Aryabhata and his successors (see the Giscusswin in the secnnd manggeh ow M 

D> 


of section 9), it is more likely that a Goetsiom inedite cnmnurint GOD Seabee 
independently in ancient India. 


Reciprocal Subtraction 


Leta and b be the lengths of two line segmests datis, Ove "real numbers ” ie mede 
terminology). Ifa > b, then one subtracts è fein a. then > Penna — bia =) > b 
then b froma — 2b if a — 2b still exceeds 5, and so on, vil one reaches a stage. Sa 
after qı subtractions, when the remainder m = à — grå does not exceed 5. Now, if 
b > ri. then, as before, one keeps on sud@ackng n» fom ò as often as possible. say 
q2 times, till one reaches the remainder ry = &—qor, whichis < 7. The sulNractive 
process is then continued with the lengths n and 7» to get qs, 73, then with r amd 5. 
and so on. This process was called Antanatresis (reciprocal subtraction) in ancient 
Greek mathematics. The sequence (finite or infinite) of quantities {e}, qs...) was 


33 ta 

33in ancient India, calculations UNIHgenita) were usually performed on sand spread on a board called 
pari. In this format, the figures being big, not many lines of figures could be written on a board: but " 
was easy to erase figures: Commptational techniques, therefore, Were planned in such a way that fewer 
figures would be used at a time, amd figures which are no longer required could be erased See [DSI p 


129]. 
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used to define the concept of Logos (ratio) in the theory of proportions developed 
by Greek mathematicians preceding Eudoxus (400—349 BCE). For further details, 
see [F] and (Vn, p 90-91]. 


Euclidean Algorithm 


Books VII, VIII and IX of Euclid’s Elements ([He]) describe the basic multiplicative 
properties of positive integers. As Weil remarks ((We2, p 4]): 


It is generally agreed upon that much, if not all, of the content of those books is of earlier 
origin, but little can be said about the history behind them. 


Irrespective of the actual genesis of specific results, the Elements (c. 300 BCE) is 
the oldest extant book which makes a systematic exposition of the fundamentals of 
"Number Theory”. 

The Euclidean algorithm for finding GCD is described at the very beginning of 
Book VII of the Elements (Propositions 1—2); and presented again, in a geometrical 
form, in Book X (Propositions 2-3)3* We briefly recall the method; for clarity, 
we take advantage of modern language, especially the subscript notation. For a 
more detailed mathematical exposition of the Euclidean algorithm and associated 
concepts like GCD at an elementary level, the reader may refer to [JJ, Ch 1]. 

Note that, for integers, the principle of Antanairesis or mutual “subtraction” 
may be abbreviated to the following principle of “division”: Given two positive 
integers a, b with a > b, there exist unique non-negative integers q,r such that 
a = bq +r with0 <r < b. This result is often called the “Division Algorithm", 
although it is a theoretical statement rather than a practical algorithm — it does 
not describe any method to find q and r when a and b are given. Although Euclid 
too uses the language of repeated mutual subtraction in Propositions 1-2 of Book 
VII ([He, Vol 2, p 296-299]), the above principle (of taking the remainder of 
one number divided by another) is implicit in his proof. It is not known how 
Greeks performed division during Euclid's time,” and how they implemented 
the “Division Algorithm" for large numbers (by any method more efficient than 
repeated subtraction). As mentioned earlier, during Renaissance in Europe, the 
modern method of "Long Division" evolved out of the corresponding ancient Indian 
method. 


34Tn Book VII, the algorithm is formulated for integers; in Book X, it is formulated for lengths of line 
segments. In the latter case, one defines the GCD, or rather its synonym the Greatest Common Measure 
(GCM), of two lengths a and b to be the greatest length g such that a and b are both integer multiples of 
g. The GCM will exist if and only if a and b are commensurable, that is, in our language, if and only if 
$ is a rational number. Pes 

35Smith writes ([Sm, p 133]: “We are quite ignorant as to the way in which the Greeks and Romans 
performed the operation of division before the Christian Era." 
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By the so-called Division Algorithm (or by Antanairesis), given two positive 
integers a, b, one has a sequence of relations a = ajb +r, 0 < rı < b; b = 
axi d r2, 0 < r < rji rj = azr 4 r4, 0 < r} < rz; and so on, till one 
arrives at a stage where r,4; = 0, that is, r„—1 = a44417,. The Euclidean algorithm 
asserts that r, is the GCD of a and b. It is based on the principle that the GCD of 
two numbers does not change if the smaller number is subtracted from the larger 
number. For instance, the GCD of 24 and 18 is same as the GCD of 6 (— 24 — 18) 
and 18. Consequently, GCD(a, b) = GCD(a — b, b) = --- = GCD(a — ab, b) = 
GCD(5, ri) = GCD(ri, r2) = --- GCD(r, 1, rn) = Tn. 

Although Elements is the earliest available text which gives a clear description 
of the algorithm, scholars of Greek mathematics believe that it was known much 
earlier, at least in its subtractive form. The Euclidean algorithm was known in China 
and in India. As explained earlier, in the absence of enough historical documents, it 
is not known whether it was discovered independently and when the algorithm was 
first introduced or discovered in these countries. 

The Euclidean algorithm is one of the oldest algorithms that is still in common 
use. A crucial feature of the algorithm is that it finds the GCD efficiently without 
requiring the determination of the prime factors of the numbers." The algorithm 
is logarithmic in the size of integers. G. Lamé (1845) proved that the number of 
division steps required by the Euclidean algorithm to compute the GCD of two 
given integers never exceeds five times the number of digits (in decimal notation) 
of the smaller integer ([Kn, p 343]). 


5. Kuttaka 
Brevity in Indian Tradition 


The original mathematical treatises in ancient India (especially the earlier texts) 
were generally very brief. The verses were meant to indicate broad hints. There 
was emphasis on the use of a learner's own intellect for working out the complete 
details.*” Detailed expositions on the works of the Masters were orally transmitted 
through the guru-sisya (mentor-disciple) link. Sometimes the commentaries give 
details. The approach of the ancient stalwarts can be seen from the following 


36Factorisation of large integers is regarded as a difficult problem; in fact, many systems of modern 
cryptography are based on the non-availability of means to determine factorisation of sufficiently large 
integers. 

37 As an extreme example, consider the brevity of Aryabhata’s instruction for constructing a sphere 
which would rotate uniformly at the rate of Earth’s rotation (Gola 22): "[Make] a wooden gola (globe) 
which is perfectly spherical and uniformly dense all around [but is] light in weight. Using mercury, oil 
and water, and applying one's own intellect, [make the globe] rotate [at the required rate] to keep pace 
with time." See [Du2, Resonance 11(3), p67]. 
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HERE I : - : : 
i i 1 t remarks that Bhaskara II makes in the epilogue of his text Bijaganita ([Ab, p 53]; 
TN. [P, p 196—197]): 


| | | The expanse of science is vast as the ocean, which a person of ordinary intellect finds too 
20 formidable to cross. But what is the necessity of details for the intelligent student? A mild 
| ri instruction suffices; for it helps the intelligent student to develop the knowledge on his own. 
d 1 Just as a drop of oil put in water, or a grain of secret confided to a villain, or a little act of 
i charity to the deserving, spreads automatically, likewise a quantum of knowledge, instilled 
| into an intelligent mind, grows and expands extensively by its own force. 


This attitude is not much different from that of most modern mathematicians who 
too feel that a budding researcher eventually gains more insight from a terse text 
than from a clearly spelt-out text, from an obscure important paper than from a lucid 
i one. (For further discussion, see [DS1, p 126]; [Du1, Remark 4; p 110-111].) 

"i The original description of kuttaka by Aryabhata (see Appendix) is excruciat- 
ingly brief. Historians of mathematics have interpreted his two verses based on 


I» ancient commentaries; for instance, B. Datta has been guided by the explanation 
2E of Bhaskara I, the earliest commentator on Aryabhatiya whose work is available 
| to us. The descriptions by Bhaskara I, Brahmagupta and several other writers are 


quite similar, in essence. Aryabhata II observed some simplifying devices which 
were emulated by later writers. In Bijaganita, Bhaskara II makes a neat presen- 
tation of the main algorithm and the useful auxiliary principles and states a few 
suggestive examples where one could implement the prescribed rules ([Ab, Ch 5]; 
[P, Ch 5]). 

In this section, we shall make a summary of the various presentations of the 
kuttaka by authors ranging from Bhaskara I to Bhaskara II. Details take up 54 pages 
in the book of Datta-Singh ({[DS2, p 87-140]). We shall try to highlight the key 
deas which would be particularly enriching for students of mathematics. 


n had applications in astronomy, most Indian 
em of determining all positive integral solu- 
= +c, where a, b, c are positive integers. 

problem like 100y — 63x = +90 would be 
om the Bijaganita (also Lilavati) of Bhaskara II 
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ksepa (interpolator), gunaka (multiplier) and phala or labdhi (quotient) respectively. 
Bhaskara I arranges the equation in the form y = ees orx = on so as to ensure 
that the interpolator c comes with a positive sign. Almost all ancient Indian writers 
observe, in some form, that the equation ay — bx = c will have integral solutions 


only if c is divisible by the GCD of a and P called apavartana or apavarta. 


Reductions 


The treatment of the kuttaka reveals a tendency by ancient Indians to employ “reduc- 
tions'?*: an aspect which has pedagogic value for training budding mathematicians. 
We shall give a few examples. 

First, Bhaskara I, Brahmagupta, Aryabhata II, Sripati and Bhaskara II, among 
others, explicitly state that all the coefficients should be divided by GCD(a, b) 
(= GCD (a, b, c)), so that the coefficients in the reduced equation become relatively 
prime, or to use ancient terminology, [mutually] drdha (firm or reduced), niccheda 
(having no divisor), nirapavarta (irreducible). The first example 221 y +65 = 195x 
in the kuttaka section of the Bijaganita of Bhaskara II ({Ab, p 20]: [P, p 57]: also 
[PNS, p 171]) is meant to illustrate this reduction. The equation 221 y—195x = —65 
reduces to 17y — 15x = —S. It may be pointed out to students that, by this reduction, 
not only do the coefficients become smaller (thereby simplifying computations), 
but, more importantly, one is now better equipped to tackle the problem as one has 
the advantage of the additional property of a and b being coprime — a property 
which is potentially useful. 

Another subtle reduction of Aryabhata II, applicable in case there is a common 
factor between a and c or between b and c, further reduces the size of the coefficients 
of x and y. Let g} = GCD(a, c), a; = TRA g2 = GCD (o. £) and b, — =, Then 
Aryabhata II, and his successors like Bhaskara II, observe that the problem of 
solving ay — bx = +c reduces to the problem of solving a; Y — bj X = +1: if 
(u, v) is an integral solution of the latter, then (2. g) is an integral solution of the 


original equation. For instance, the problem of solving 100y — 63x = +90 (cited 
by Bhāskara II) reduces to solving 10Y — 7X = +1. Once one obtains the solution 
X = 3, Y = 2 of the reduced equation 10Y — 7X = —1 by the main algorithm 
(in this case, one can also get it easily by inspection!), one immediately gets the 
solution x — 30, y — 18 for the original equation 100y — 63x — —90. 

Second, the problem of finding all positive integral solutions is reduced to that 
of finding one positive integral solution. Suppose that (u, v) is a positive integral 
solution of ay — bx = +c. From (u, v), one first finds the minimum positive integral 


38, : LE 
It is now.a standard trick in modern mathematics to be on the lookout for simplifications or 
“reductions” by which efforts are made to transfer a problem to an equivalent but neater and possibly 
more tractable problem where the underlying features become more transparent. 
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solution (a, £).*? Dividing u and v by a and b respectively, we have u = par 
( and v = qb + s for some whole numbers p,q,r,s' such that O0 < r < a and 
Md 0 < s < b. If p = q, then (r, s) is clearly a solution of ay — bx = c; in fact, it 
| Pi is the minimum positive integral solution (proof is easy). If p # q, then it can be 
| I seen that p « q when we consider the equation ay — bx — c; and p » q when 
M we deal with ay — bx — —c. The minimum positive integral solution in the two 
D beet cases are (r,s + (q — p)b) and (r + (p — q)a, s) respectively. The above rule for 
H , l . arriving at the minimum solution has been explained lucidly by Aryabhata II but 
h "ni is already implicit in Bhaskara I. If (a, 8) is a minimum positive integral solution 
EE ty of ay — bx = +c, then Bhaskara I and his successors describe the general positive 
| f i 1 integral solution as (a + ta, f + tb) where t is a positive integer. 
ITI Third, it is clearly enough to solve an equation of the type ay — bx = +1; 
Wd for, if av — bu = +1, then a(cv) — b(cu) = +c. Such equations were called 
i T sthira-kuttaka (constant pulverizer). This simplification too was made by some of 
t! the Indian mathematicians right from Bhaskara I. As we shall see towards the end of 
t | this section, in problems of astronomy involving the equations ay — bx = c, the 
| Hn" conditions were often such that the coefficients a, b would be the same in several 
i equations but the interpolator c would vary. In such situations, working first with 
the constant pulverizer and then modifying the solution according to the specific 
problem would have been convenient. 

Some of the Indian authors also showed that we can reduce to the casea > b. This 
step was achieved through two different methods. The later method corresponds to 
the modern approach: if b > a, think of the equation as bx — ay = -rc rather than 
ay — — bx = +c. But earlier writers like Bhaskara I, who already had to arrange the 
equation ay — bx — c so as to have only positive coefficients, used the following 
device when b > a: Let b = aq + bı where b, < a. Then the original equation 
transforms into the equation ay; — bıx = c (where y; = y — qx) which is of 
the desired form. If (u, v) is a solution of ay; — bix = c, then (u,v + qu) is a 

ped — bx — c. We shall henceforth assume a > b as it would facilitate 


2 the crux Gb/Rcydblista's algorithm using the interpretation of 
es ssive division, we havea = aib Fri, b = ari +r, ri = 

s note the number of steps after which the process 

gir iani e is iS a2 = an'n- pl x < 


E. 
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quantities dj, --- , à, can be quickly determined by the method of “long division" 
for computing the GCD of a and b. 

Now, for solving ay — bx = 1, define quantities Xn+2, Xn+1, Xn,--- by backward 
induction as follows: define x,:2 and x44; to be whole numbers satisfying the 
relation r, 1 X442 — Xn+1 = (—1)". Thus, ifn is odd, one can simply take x,» = 0 
and x,+; = 1; if n is even, one can take x4,45 = 1 and x44, = r,-; — 1. Now 
define xm (n > m > 1) by Xm = AmXm+1 + Xm+2. For facilitating the computation 
Of Xn, ++- , X2, Xj, the Indians constructed convenient tables called valli. As will be 
clear from subsequent discussions, the numbers x;, x? satisfy ax? — bx, — 1; thus 
(x1, x2) is a solution of the equation ay — bx — 1. 

As a simple example, consider 10y — 7x — 1 (the reduced form of one of the 
exercises set by Bhaskara II). Here 


a = 10, b = 7; so thata; = l1,r; = 3;a2 —2,ra = 1. 
The reverse algorithm yields 
X4 = l; x43 = 3 — 1 = 2; xa(2 aoxa + x4) = 5; 11 (= aix2 4- x3) = 7. 


Thus we obtain the solution x — 7, y — 5. 

Bhaskara I explains that one need not continue the repeated division till the stage 
r, = 1. One can terminate at an intermediate stage k if, having obtained rj, - - - , rx, 
onecan observe, by inspection, positive integers u, v satisfying ry-1u—ryo = (—1 x 
One can then define x,2(— u), xy41(— V), -++ , x2 (= y), xı (= x) recursively, as 
before, and obtain a solution of the equation ay — bx — 1. 

For solving ay — bx = —1, one considers numbers Xz+ (k < n) and x42 
satisfying rk—1xXk+2 — rkXxęķ+1 = (—1)**1; the rest is as above. Or, one could take 
the approach of Bhaskara II: solve ay — bx = 1 and use the fact that if (a, B) is 
the minimum positive integral solution of ay — bx — 1, then (a — a, b — p)is the 
minimum positive integral solution of ay — bx — —1. 

For instance, consider 10y — 7x = —1. While a1, r1, a2, r2(= 1) are as before, 
now the reverse algorithm yields 


X4 = 0, x4 = 1 so that x2(— a2x3 + x4) = 2; x1(— aix2 + x3) = 3; 


and we obtain the solution x — 3, y — 2. Alternatively, having computed the 
solution (7, 5) of the equation 10y — 7x = 1, one can immediately deduce the 
solution x = 10— 7 = 3 and y = 7 — 5 = 2 for the equation 10y — 7x = —1. 
Brahmagupta, Bhaskara II and Narayana give similar rules for deriving integer 
solutions of ay 4- bx — c. 

In section 4, we had mentioned about the efficiency of the Euclidean algorithm 
for determination of the GCD. As can be seen from the above steps, the extension, 
by kuttaka, to the solution of the linear indeterminate equation ay — bx = c, is 
again an efficient algorithm. 
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The Descent-Like Idea 


As indicated by Weil in [We2, p 93], there is a descent-like idea in the kuttaka 
method for solving ay — bx = +1. We may formulate it as follows (to fix our ideas, 
let us take the RHS to be 1): 


(1) Assume the existence of a positive integral solution (x, y) to the equation 
aY —bX = I. 


(2) Then transform this equation in successive steps into equations with smaller 
and smaller coefficients and solutions to eventually arrive at an equation 
a' Y' — X' = +1 with an obvious solution (x^, y^) = (a't = 1, t) for any t. 


(3) Then work backwards from this obvious solution (x’, y’) of the reduced 
equation to determine the desired solution (x, y) of the original equation. 


To elaborate, assume that (x, y) is a positive solution of the equationaY — bX = 1. 
Recall that we are assuming 0 « b < a and a, b are coprime. If a — b = 1, then 
(1, l)isasolutionofaY — bX = 1. So we may further assume thata — b > 1. Then 
x > y. If b had been 1, we would have been through. This prompts the approach 
(instructive for students): why not try to “break” (i.e., *pulverize") the coefficients 
a, b into smaller ones? Recall the relation a = ajb + r; with O < rı < b. Each 
component of the pair (b, rı) is smaller than that of the pair (a, b). Now one tries to 
transform the equation aY — bX = | to get an equation with coefficients b and ry. 
The relation ay — bx = 1 leads to the relation (aj y — x)b 4- riy = 1. 

Setz := x — a y. Then clearly 0 < z < y and (y, z) is a positive solution of 
the equation bZ — riY = —1 which is again a linear equation of the original form; 
but now the solution (y, z) is co-ordinate-wise smaller than the original (x, y) and, 
moreover, the coefficients (of the new equation) too have become smaller since 
0 < b «aandO0 <r; < b. Thetwo pairs of solutions being linearly related, if one 
can determine the smaller pair (y, z), one can easily compute the original (x, y). 

Denote x, y,z by xj, X2, X3 respectively. Proceeding as above (introducing 
X4,%5,°--, etc.), since rn = 1 (as per our earlier notation), one eventually ar- 
rives at the easily solvable equation r,-1 X542 — Xn41 = (—1)". From this equa- 
tion, one works backwards to arrive at the solution (x, y) of the original equation 
aY —bX =1. 

For instance, in the example 10x? —7x, = 1, the kuftaka uses the fact that solving 
10x; — 7x; = 1 is equivalent to solving 7x3 — 3x2 = —1 (where x3 = xı — x»), 
which, in turn, is equivalent to solving 3x4 — x3 = 1 (where x4 = x2 — 2x3); and 
the latter has the obvious positive integral solution x3 = 2, x4 = 1. 

Thus, the main algorithm itself is an illustration of a non-trivial application of the 
modern “reduction” principle: it transfers, through a sequence of steps, a somewhat 
involved problem to a problem with an obvious solution. Since the Process involves 
the breaking up of the original data (both solutions and coefficients) into successively 
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smaller numbers by repeated division, the term kuttaka (pulverizer) is apt; in fact, 
it gives a hint to the learner of the main idea behind the algorithm. Thus the kuttaka 
(both the method as well as the term itself) resembles Fermat’s celebrated principle 
of descent. As Weil remarks ([We2, p 7]): 


In later Sanskrit texts this became known as the kuttaka (= "pulverizer") method; a fitting 
name, recalling to our mind Fermat’s “infinite descent”. 


Euclidean Algorithm and Kuttaka 


There is a tendency among writers on history of mathematics to make remarks to the 
effect that the solution to the linear Diophantine equation “is essentially identical 
with" or “does not differ substantially from” or “is justa straightforward application 
of" the “Euclidean algorithm" for finding the GCD, and thereby to undermine (often 
inadvertently) the achievement involved in the discovery of the kuttaka. As we have 
seen, there are two major steps or principles involved in the ancient Indian kuttaka, 
as also in Bachet's solution of the linear indeterminate equation ay — bx — c. 


I Computation of the GCD d of a and b. 
II Expression of d as a linear combination of a and 5. 


In modern college-algebra, the two steps may be viewed, respectively, as corre- 
sponding to the algorithmic proofs of the following generalised abstract statements: 
(D) “Z is a Euclidean Domain and in a Euclidean Domain, any two non-zero elements 
have a GCD.” and (II) "Any Euclidean Domain is a PID?" 

Step II does not occur in the Elements or in any extant ancient Greek text. Its 
explicit presence can be seen in ancient Indian texts; in Europe, it occurs for the 
first time in Bachet's work (1624). Modern mathematicians, well-trained in years 


40A Euclidean Domain is an integral domain R in which the "Division Algorithm" holds for any two 
elements a and b of R with b $ 0; that is, there exists a (non-negative) integer-valued function N on R 
with N(0) = 0 such that given a, b( 0) € R, there exist q,r € R such thata = bg +r with either 
r = Qor N(r) < N(b). A PID (Principal Ideal Domain) is an integral domain in which every ideal is 
principal, that is, every ideal is of the form Rd for some d € R. 

The concept of GCD of two integers has the following generalisation in the case of an integral domain 
R:- d is said to be a GCD of two non-zero elements a and b of R if (i) a = ad and b = bid for some 
ai, bı € R, and (ii) if d’ € R is such that a = a'd' and b = b'd' for some a’, b' € R, then d = did’ 
for some d; € R. As in Euclid's Elements (Proposition 2 of Book VII), the repeated application of 
Division Algorithm shows that any two non-zero elements of a Euclidean Domain have a GCD and that 
it can be computed algorithmically. By describing, again algorithmically, how to express the GCD as 
a linear combination of a and b, Aryabhata's kuttaka shows that the ideal generated by two non-zero 
elements a and b of a Euclidean Domain R is the principal ideal generated by their GCD thereby giving 
à constructive proof of the property that any Euclidean Domain is a PID (the converse is not true). For 
further details on Euclidean Domains and PIDs, see chapter 8 of Abstract Algebra by D.S. Dummit and 
R.M. Foote (Wiley 2003). 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


eve pom a 


—— 


edd 


"Da C 


rto riget e a 


PT omega ios 


172 Amartya Kumar Dutta 


of high-school classical algebra, sometimes think of Step II as a trivial corollary to 
Step I. But to appreciate Step II in the context of its history in India, one should 
note that it was discovered before algebra had formally established itself in Indian 
mathematics and that at the time of discovery and fora few centuries thereafter, it was 
considered a significant feat by some of the strong ancient Indian mathematicians. 
Even Brahmagupta's treatise imparts considerable emphasis on the kuttaka and 
provides plenty of illustrative examples. All of them touch Step I lightly, they dwell 
mainly on Step II. One has also to note that, after Bachet's rediscovery (1612) of the 
kuttaka, Legendre had described the method as very ingenious as late as in 1798."! 

A profound aspect of the kuttaka method for solving Step II is that it contains 
the seeds of the idea of “descent”. In fact, Bachet's rediscovery of the method might 
have, in a subtle way, influenced the discovery of the “descent” principle. While 
analyzing Brouncker's solution to the equation Dx? + 1 = y?, Weil brings out 
the implicit *descent"-like idea in Brouncker's method and remarks that Bachet's 
method must have provided a model for Fermat and Brouncker. We quote below 
excerpts from the passage of Weil ([We2, p 93]); we have highlighted a portion in 
italics. 


The starting point here is to assume the existence of a solution (x, y) of x? — Ny? = 1, and 
to transform the problem by successive steps into others, each of which has a solution in 
smaller numbers; . . . this is typical of Fermat's descent; the difference lies in the fact that in 
the latter case one seeks to prove that there is no solution, while in the former the purpose is 
to find one. At the same time, Bachet's method for the solution of the equation ay — bx = +1 
undoubtedly provided a model for Wallis, Brouncker and Fermat to follow, . . . 


Weil then revisits Bachet's solution of the linear indeterminate equation, bringing 
outits descent-like aspect. It may be noted that the discovery of the Indian cakravala 
(which also has a descent-like aspect to it) had taken place in an environment where 
the kuttaka method had a strong influence; in fact, the kuttaka algorithm itself was 
used in the solution (as we shall see in section 7). 

The above considerations suggest that the kuttaka is perhaps more subtle than 
what its description as being "essentially the Euclidean algorithm" would indicate. 
Step II is of interest in computational number theory and is a fundamental step 
in cryptography. The equation d — ay — bx, expressing the GCD d as a linear 


414 pain, one has to consider the historical context: around Bachet's time symbolic algebra had just 
begun to establish itself in Europe. Even F. Viéte (1 540-1603), who had a pioneering role in introducing 
symbolic algebra in Europe, did not consider negative numbers. In fact, till the 16th century, Arab and 
European mathematicians had struggled even with problems involving equations of the type ax +b = c 
(where a, b, c are positive numbers) as can be seen from the prevalence of the cumbrous “rule of false 
position". As D.E. Smith remarks ([Sm, p. 437]): "To the student of today, having a good symbolism 
at his disposal, it seems impossible that the world should ever have been troubled by an equation like 
ax + b = 0. Such, however, was the case, . . . " 
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combination of integers a and b, is sometimes known as Bézout's identity ([JJ, p 7]): 
perhaps Áryabhata's identity would be more appropriate historically. 


Continued Fractions 


The kuttaka may be interpreted as a technique in the theory of continued fractions. 
The formulation y — Bs and method of solution have given rise toa suggestion that 
the pioneers of the kuttaka algorithm knew the basic principles of continued fractions 
([Bg]). Knowledge of continued fractions is more apparent in some of the later Indian 
works. In the versions described above, after obtaining the quotients a1, --- , an, 
one computes quantities Xn, x4—,,:-- in the backward direction (i.e., one moves 
from below to top in the table). But in certain later texts like the Karanapaddhati and 
the Yuktibhasa (1540 CE) of Jyesthadeva, having obtained the quotients aj, - - - , dn, 


one constructs sequences of numbers pm and gm by the recurrence relations | 


po = l, pi = ài: Pm = GmPm-1 + Pm-2 | 
qo = 0, n= 1; Qm = 4nQm-1 + dm-2 | 


till one reaches p, and q, (where n is as before). Then aqn — bpn = (—1)"*" i.e., 
(Pn, Qn) isasolution of one ofthe equations ay — 5x = +] (the solutions of the other 
equation can be derived using techniques discussed earlier). Note that the quantities 
41, 02, *** , An, Qn41(= rn-1) are the “quotients” in the continued fraction expansion i 
of $; and 2 are the successive “convergents” (which are actually characterised by | 
the above recurrence relations). The solution of the linear Diophantine equation in 
the notation of continued fractions was given by Saunderson in England in 1740 
and Lagrange in France in 1767. 


Interpretation of Sathaye 


We have discussed the traditional Indian approaches to kuttaka due to Bhaskara I 
and later mathematicians. Avinash Sathaye has given an interesting and efficient 
version of Aryabhata’s algorithm that is lucidly explained, with an example, on his 
website ([Sat, chapter 3]; especially section 3.2). Sathaye's version can be presented 
as follows. 

We are to find x and y such that ay — bx = d, where d is the GCD of a and b. Let 
41, 02, --- be the successive quotients and r}, 72, --- be the successive remainders 
in the standard division algorithm for computing the GCD of a and b. Thus, if 
a > b, we have, rı = a — a;b, r2 = b — arı = —asa + (1 + a1a35)b, and so on. 
Now set 


ug := 0, 09 :— 1, uy = l, vy :— ~a]. 
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To 
ü Thus au; + bv; = rı. Define un, Vn inductively by: 

j 

| 


Un = Un—2 — AnUn—1; Ùn = Uj—2 — AnVn-1- 


Then one can see that rj, = au, + bv», for each m; in particular, if d is reached at 
stagen (i.e., ifr, = d),then(—o,,, u,)isasolution of ay—bx = d. In fact, at the next 
Pi stage, we get aUn+ı +bvn+4; = O(sincer,,, = 0) so that (—v,_, — 10941, Un Htun) 
| is a solution for any integer f. 

While the above algorithm fits the terse verses of Aryabhata, it requires the use 
T of negative numbers (otherwise, one has to make a careful adjustment of sign as 
before). One is not sure if the use of negative numbers had become sufficiently 
well-established (even among any elite section) during his time. But irrespective of 


; Hi whether Aryabhata actually meant it, the above version would be a useful supple- 
| Ni ment for pedagogic purposes. 
MI 


T fE Sarislista Kuttaka 


Ancient Indian algebraists also solved simultaneous linear indeterminate equations 
of the type 


biy — axı = c1, boy — a2x2 = C2, --- , br Y — a,x; = 6, 


under the obvious consistency conditions. Note that the special case bj = b» = 


--b, = listhe “Chinese Remainder” problem of finding an integer y which when 
| l divided by a; leaves a remainder c;, when divided by az leaves a remainder c2, and 
so on. | fact, Aryabhata's verses actually mention the problem of simultaneous 


s (for r = 2). It is not clear from his brief verses whether Aryabhata 
nly the case b; = b> = 1 or the more general case; the commentary on 


i a (non-residual pulverizer) for the 
us linear indeterminate equations; and the term sagra- 
or the equivalent problem of solving simultaneous | 
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by each a;, (1 < i < k), leaves remainder c;. An astronomy problem based on a 
linear indeterminate equation was called graha-kuttakara (planetary pulverizer). 


An Application in Astronomy 


Ancient Indian astronomers conceived of long time-periods (yuga, kalpa, etc) in 
which the five visible planets along with the Sun and the Moon were estimated 
to execute integral numbers of revolutions starting from a time when the above 
celestial objects were “in conjunction" — that is, all were on the great circle of 
the "celestial sphere" passing through the “vernal equinox” and the pole of the 
"ecliptic".? For instance, by the estimates in Aryabhatiya ([ShuS, p 6]), a yuga 
of 4320000 “‘sidereal years" has 1577917500 civil days and, during this period, 
Saturn and Mars perform, respectively, 146564 and 2296824 revolutions. Ancient 
astronomers must have used the kuttaka for estimating the moments when the visible 
planets (and the Sun and the Moon) would be in conjunction. It is perhaps significant 
that Aryabhata, who begins his treatise by mentioning the time of commencement 
of the current yuga and the number of revolutions performed by each planet in a 
yuga, ends his mathematics chapter by describing the kuttaka. 

We end this section by giving one example of a typical application of the kuttaka 
in Indian astronomy texts. Let D denote the number of civil days in a certain epoch 
and N the number of revolutions performed by a planet in that epoch. Let y denote 
the ahargana, the number of mean civil days elapsed since the beginning of the 
specified epoch up to a given day; and let x denote the bhagana, the number of 
complete revolutions performed by a planet during this period. Note that x is the 
quotient obtained by dividing the product yN by D. The kuttaka was applied to 
determine the ahargana y and the bhagana x from the residue r = yN — xD (D 
and N are known constants). For instance, in Laghu-Bhdskariya (chapter 8, verse 
17), Bhaskara I gives certain algebraic conditions satisfied by the residues of Saturn 
and Mars (on a given day) and asks the reader to calculate the ahargana (for that 
day) and the bhagana of Saturn and Mars ([Shu3, p 99]). Since Saturn performs 
146564 complete revolutions in 1577917500 days, on dividing the two integers by 
the common factor 4, we get an intermediary epoch of 394479375 days in which 
Saturn performs 36641 revolutions. Taking D = 394479375 and N = 36641, 
one gets the sthira-kuttaka 36641Y — 394479375 X = 1 from which the solutions 
for y and x can be obtained depending on r. In the case of Mars, one gets the 
sthira-kuttaka 191402Y — 131493125X — 1. 


32An explanation of the above terms is given in [Du2, Part 1: The Khagola (The Celestial Sphere) 
and Part 2: The Naksatra Dina (Sidereal Day)] and the significance of yuga in Indian astronomy has been 
discussed in [Du2, Part 3: A Brief History]. 
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EA 6. Bhavana 

| The Main Principle 


| l Brahmagupta’s presentation of results on quadratic indeterminate equations resem- 
bles a typical modern arrangement where one first develops the fundamental theory 
or principle that emerges out of the investigations into a problem, and then records 
the solution to the original problem(s) as one application of the basic theory. Thus, 
after announcing Atha Varga Prakrtih**, Brahmagupta immediately states the fol- 


lowing result ([Sha, Ch 18; verses 64-65]) which is the cornerstone of the entire 
section. 
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Theorem 1 (Brahmagupta's Bhavana). 
If y? = Dx +m; i 21,2, then y = Dx,x24y,y2,X = xiyyo Exoyi, m = myma 
satisfy y? = Dx? +m.“ 
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The Appendix quotes the portion of Brahmagupta’s verses pertaining to the 
above result. Theorem 1 is also stated in verse form by Acarya Jayadeva, Bhāskara 
II (1150), Narayana (1350), Jhanaraja (1503) and Kamalakara (1658); a proof is 
described in the Bijapallava (1548) of Krsna ([DS2, p 148—149]). i 

Let D denote a fixed positive integer. For convenience, we shall use the notation 
(p, q4; m) to denote a triple of numbers satisfying Dp? + m = q?. Then, in modern 
language, Theorem 1 states that the solution space of the equation Dx? + m = y? 
admits the binary operations 


TITAN vim) © (%2, y2; M2) = (x1 y2 + X21, Dx1X2 + yiyoi mam»). 
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In the commentary on Brahma Sphuta Siddhànta by Prthüdakasvamin (c. 860 
CE), it is repeatedly emphasised, during the exposition of Brahmagupta's verses 
stating Theorem 1, that the new roots (x, y, m) have been obtained from the given 
roots (x1, y1, mi), (x2, y2, m2) by composition ((Col, p 363—364, footnotes]). Just 
before stating the bhavana formulae, Jayadeva mentions ([Sh], p 5) that the bhavana 
operations pervade endless algorithms. The importance attached to the bhavana 
also comes out in the version of Theorem 1 due to Bhaskara II (see [Du1, p 111—112]) 
from which we quote an excerpt: 


Set down successively the lesser root [x;]. the greater root [y;] and the interpolator [m]; ]. 
Place under them, the same or another [triple x2. y2, 712], in the same order. From them, by 
repeated applications of the bhāvanā, numerous roots can be sought. Therefore, the bhdvand 
is being expounded. ... 


The perception of Theorem 1 as a law of composition (i.e., in terms of an 
operation like ©) comes out in the applications by Brahmagupta and others, in the 
version of Theorem 1 due to Bhaskara II and, above all, in the choice and use of the 
term bhavana. 

In general, the Sanskrit word bhavana has several meanings including “pro- 
duction" and “demonstration”; in the context of mathematics, bhàvana means 
"composition" or "combination". A similar Sanskrit word bhdvita (usual mean- 
ing: “created”, “produced”; or "future", “to be") was adopted in Indian algebra 
by Brahmagupta and others to denote the product of different unknown quantities. 
The law © (or x) was perceived as a special mathematical operation involving a 
special multiplicative principle. In fact, Udaydivakara (1073 CE) explains bhavanà 
as multiplication ([Shul, p 6]). 

While the precise sense of bhavana, in the context of © (orx), is that of a principle 
of "composition" — a sort-of generalised "product" — the term also carries the 
additional suggestion of being a principle of “production” (i.e., generation") of 
new roots, and of being a powerful “lemma” (i.e., “producer” of new results). The 
description of Theorem 1 by Brahmagupta and other Indian algebraists are invariably 
followed by results crucially involving applications of the bhavana principle ©. In 
view of the importance of the laws like C, an appropriate terminology was chosen 
which would be rich in significance — simultaneously conveying the sense of a 
multiplicative principle of composition, a principle of generation, a special lemma, 
etc. , 

Theorem 1 can be formulated in the form of the identities 


Qi? — Dx)? — Dx?) = (Dxix? + yiya2)? — D(xiya + xoyi? 


which are sometimes called Brahmagupta's identities. 


"351p the light of modern developments, Jayadeva's statement has turned out to be prophetic. 
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IET. Some Applications of the Bhavana by Brahmagupta 


{ After stating Theorem 1, Brahmagupta gives useful offshoots. First, he indicates 
([Sha, Ch 18, verses 64—65]) how to construct infinitely many rational solutions of 

| | Dx? + 1 = y? (to be discussed in next section). Next he points out ([Sha, Ch 18, 
n H^ verse 66]) that infinitely many integer solutions of Dx? +m = y? can be generated 
ri from a given solution (if it exists) and a solution of Dx? + 1 = y? (see below for 
examples with D — 13); in particular, from one integer solution of Dx? 4- 1 — y?, 
one can generate infinitely many integer solutions ({Dul, p 90—93]). And then, 


Brahmagupta gives his partial solution to the original problem ([Sha, Ch 18, verses 
67-68]; [Dul, p 93-95]. 


Theorem 2 (Brahmagupta). 


(i) Lp) a = q°, then (5 p(q? — 1), 3a(q? — 3)) isa solution of Dx? +1 = 


y 
} | pi (ii) If D? — 4 = q?, andr = (q? + 3)(q? + 1), then (pqr, (q? + 2)(r — 1)) 
Ui is a solution of Dx? + 1 = y?. 


Let (p, q;m)" denote (p, q; m) © (p, q;m)--- © (p, qm) n times. The results 
(i) and (ii) can be obtained by considering the products (p, q;4)? and (p, q: 4) 
respectively and making necessary substitutions and simplifications. For instance, 
for (i), using the relations (p, q;4) © (p, 4;4) = (2pq, Dp? + q?; 4?) and Dp? = 

q’ —4, and cancelling the common factor 4 from the consequent identity D(2pq)* + 
p = (2q a= Aye. one gets the triple (pq, q 2—2; 4). Next, considering the composition 
(p 4) © (pq, q* — 2;4) and making adjustments as above, one would arrive at 

> (5 p(q? — 1), 34(q? — 3); 1). 
are illustrated by numerical examples which are usually in the spirit 


rises” — 


Dividing the consequent 1c 
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triple (3, 24; 1). Now 


> 24:1 S DA = (120, 1151: 1 
DAE ONG = (120, 1151; 1), 


an integer triple. Thus (120, 1151) is a solution of 92x? + 1 = y?. m" 

The thrill of Brahmagupta at his discovery can be felt from the phrase he used 
after stating the equation 92x? + 1 = y?: kurvannávatsarád ganakah — “One who 
can solve it within a year [is truly a] mathematician.” 

Note that in this example m = 8 in the initial triple whereas Theorem 2 is a 
statement for 7 = +4. Through the example, Brahmagupta conveys to the reader 
that, by clever algebraic manipulations, Theorem 1 can be made to cover various 
cases where the initial triple is not necessarily of the form (p, q; +4). 

Another remarkable example of Brahmagupta ([Sha, Ch 18, verses 71—72]) is the 
equation 83x?--1 = y?. Applying Theorems 1 and2 on the identity 83 x 17—2 = 9?, 
one gets the solution (x, y) = (9, 82) from which one can generate a sequence 
of successively larger solutions using Theorem 1; one of the solutions would be 
x = 175075291425879, y = 1595011813884802 (see [OR]). 

Through this example, Brahmagupta expects the reader to observe that since 
(+2)? = 4, successive applications of Theorems 1 and 2 on an integer identity 
Da? +2 = b? will give an integer solution to the equation Dx? + 1 = y^: A later 
mathematician Sripati clearly formulates ([Sih, p 40]; [DS2, p 157]) this immediate 
consequence of Brahmagupta's results: From any positive integer solution of Dx? 4- 
m = y?, where m € (—1, +2, +4}, one can derive a positive integer solution of 
Dx? + 1 = y? by repeated use of the samasa-bhavana.*° 

As an illustration of the principle of generating larger roots of Dx? + m = y? 
from a given root, Brahmagupta mentions the equations 13x? + 300 = y? and 
13x? —27 — y? ([Sha, Ch 18, verses 75]). By inspection, one has solutions (10, 40) 
and (6, 21) respectively. Also from the relation 13 x 1? — 4 — 3?, one gets the 
solution (180, 649) of 13x? + 1 = y? by Theorem 2. Now taking the composite 
of (10, 40; 300) with (180, 649; 1), one gets the larger solution (13690, 49360) of 
13x? + 300 = y?; further compositions will yield still larger solutions. Similarly, 
for 13x? — 27 = y?, one considers (6, 21; —27) © (180, 649; 1), (6,21; 27) © 
(180, 649; 1) © (180, 649; 1), and so on. 

In section 1, we mentioned that it is now known that all positive integer solutions 


n 
of Dx? + 1 = y? are given by (Xn, Yn) satisfying y, + V Dx, = (n T V Dx) P 
where (x1, y1) is the fundamental solution (i.e., the minimal positive integral so- 
lution). Once the fundamental solution is determined, one can see that the other 


46This gives another strategy for solving Example 1: it is enough to solve 92x? + 4 = y? and hence 
23x? + 1 = 23; use 23 +2 = 5°. 
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solutions (x;,, Yn) are given by Theorem 1. The sequence E In 21 | forms a se- 


quence of "best" rational approximations to v D (the precise meaning will be stated 
in section 7). 


Some Perspectives on the Bhavana 


We now recall a few points that are further elaborated in the author's article [Dul]. 


(1) Brahmagupta's very approach to the problem anticipates a modern trend 
(which perhaps began with Lagrange): the quest for general principles, the attempt 
to view a problem as part of a larger set-up. While trying to solve a specific hard 
problem Dx? + 1 = y? (in two variables), Brahmagupta undertook a bold and 
farsighted exploration of the general picture: the solution space of Dx? + m = y? 
(in three variables). In the process Brahmagupta discovers and extracts an important 
general and abstract principle (Theorem 1), and makes a clear enunciation of this 
principle. 

(ii) Brahmagupta envisages the key ingredient of a modern abstract structure: 
binary composition. If we exclude the four elementary arithmetic operations, the 
bhavana is perhaps the first conscious construction of a binary composition. Note 
that the binary operation is quite a complicated one: it involves two integral triples 
of unknown roots. Thus, in an attempt to solve an indeterminate equation in two 
variables, a seventh-century mathematician thought of constructing, what amounts 
to, an intricate abstract structure on the solution space of an equation in three 
variables." One does not see such an approach in mathematics during the next 1000 
years. The discovery of an algebraic structure, such as Brahmagupta's composition 


law, on a set of significance, is now an important theme in modern mathematics 
research. 


(ili) The power of the general composition principle of Brahmagupta can be seen 
from the ease with which it immediately provides the solution of a difficult equation 
like 92x? + 1 = y?. Ironically, this very simplicity tends to cloud the historian’s 
perception of the true worth of Theorem 1. A casual observer may miss the richness 
and magnificence of ideas encapsulated in the 2-step solution of 92x? + 1 = y?. 
Further, once the mathematical community gets accustomed to an original idea 
like the bhavand, it becomes all the more difficult to fathom the greatness of the 
discovery. Historians need to be aware of this intrinsic risk of missing the real depth 
and significance of Brahmagupta’s work on the varga-prakrti. 


47Recall that even basic symbolic computations with unknown roots — treating them as if they 
were known quantities — is a fairly modern approach (from late 18th century) that emerged during the 
investigations on the general polynomial in one unknown. 

48The idea occurs in a fluid amorphous form; it had not been crystallised in a precise set-theoretic 
framework — the solution space was not presented as a single set. 
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(iv) The bhavana influenced, directly and indirectly, subsequent research on in- 
determinate equations by Indian algebraists. Apart from providing partial solutions 
to the original problem, the samdsa-bhdvand contained the key to the subsequent 
complete solution by the cakravála algorithm as we will see in the next section. 


(v) Sometimes the brilliance of a particular algebraic research work lies in its 
opening up of new and unexpected horizons with immense possibilities through 
surprisingly simple innovations. Let us view the samdsa-bhavand in this light. Not 
only did Theorem 1 influence the complete solution to the original problem, it had 
opened up new possibilities whose true potential began to be harnessed only after 
its rediscovery in the 18th century. Theorem | turned out to be “a theorem of capital 
importance" (in the words of Euler??). For instance, the samása-bhavaná © has an 
elegant interpretation in terms of the norm function — a very important concept 


in modern mathematics. Let A — Z [vD] = fo -F a D|a, b € z}), where Z 


denotes the set of integers. The norm function on A is the map N : A — Z defined 
by 


N (» T xD) = (» ar xv D) (» — xD) = y?— xD, 
Brahmagupta’s identity may be reformulated as the statement: 
The norm function N is multiplicative, i.e., N(aB) = N(a)N(B) Va, P € A. 


Now the solution space S :— ((x, y, m) e Zx ZxZ| Dx?--m = y?}is precisely the 
graph of N and we have a bijection f : S — A defined by f(x, y, m) = y - x / D. 
Transferring the ring multiplication on A to S via f~", we get the samasa-bhavaná. 
Thus the exotic multiplicative structure © on S actually corresponds to the natural 
ring multiplication on A. 

It is tempting to surmise that Brahmagupta discovered Theorem 1 through an 
algebraic manipulation which was, in essence, the verification of the structure 
preserving property ofthe norm function — the natural multiplication in A providing 
the precise formula (see [Du1, p 99-101 ]). í 


(vi) In the language of quadratic forms, Theorem | says that two binary quadratic 
forms with a given discriminant D (say y? — Dx?, v? — Du?) can be composed” 
to yield another such form with discriminant D in a new pair of variables (xv + 


39Euler highlighted the result as theorema eximium; see ({We2, p 204-205]). 

50The theory of composition of quadratic forms, an important and rich topic initiated by Gauss 
and Dirichlet, is still an active area of research. In recent years, Manjul Bhargava discovered higher 
composition laws, a work which has many important ramifications. An account is given in Bhargava's 
article Higher composition laws and applications in the Proceedings of International Congress of 
Mathematicians (ICM), Madrid (2006). 
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yu, yo + Dxu). Theorem 1 can also be viewed as the result (see [Du1, p 83-84] for 
details): 


. 2 2: . H 
The binary form y^ — Dx* is strongly multiplicative (over rationals). 


This version of Theorem | was generalised in 1965 by A. Pfister using, what are now 
called, “Pfister forms". Pfister’s discovery opened up new directions in the theory 
of quadratic forms. Theorem 1 thus happens to be a starting point of a distinct area 
of research in algebra. In his 1990 research monograph [Oj] in this field, Manuel 
Ojanguren begins Chapter 5 by quoting Brahmagupta's original Sanskrit verses 
describing Theorem 1; the chapter itself is titled Also sprach Brahmagupta”. 


(vii) While itis exciting to dwell on the implicit occurrence of modern principles 
inanancient text, let us not lose sight of one aspect of the section Varga-prakrti: the 


sheer wizardry in classical algebraic manipulation at an early stage in the history of 
symbolic algebra. 


(viii) As argued in section 3 of our earlier paper ([Du1]), Brahmagupta's treat- 
ment of varga-prakrti has a unique pedagogic potential. Theorem 1 defines an 
intricate binary operation © on S = {(x, y,m) e Z x Z x Z| Dx? +m = y’}. 
This sophisticated idea of constructing a binary composition on an abstractly de- 
fined unknown set is the quintessence of modern “abstract algebra”. Theorem 1 
and Example 1 can be effectively used to convey to a student the power of the con- 
cept “binary composition” and help him appreciate the essence of abstract algebraic 
ideas. It could be inspiring for the student to realise that the simplicity of the solution 
to Example 1 has its secret in the monoid structure of (S, ©) which in turn has its 
root in the natural ring structure of the set A (defined above in (v)). As we mentioned 
in [Dul, p 88], “the unexpected invocation of an abstract algebraic technique for 
a concrete problem by an ancient mathematician could infuse in him [a student] a 
dynamic and creative vigour in his formal study of abstract algebra. Thus, through 
this deep but easily accessible work of a Master, the perceptive student can get 
an early exposure to modern sophistication, see a natural application of abstract 
algebraic principles, develop a flair for exploring worthwhile generalisation, and 
truly imbibe the spirit of the abstract algebra culture without getting swept away by 
its formalism.” 


(ix) Due to the paucity of adequate historical materials, it is often difficult to 
ascertain whether a particular result in a treatise was discovered by the author or by 
a predecessor. One does not know who was the true discoverer of the cakravala; 
even one cannot be sure whether Aryabhata was the pioneer of kuttaka. But the 
bhavand has the unmistakable stamp of Brahmagupta’s genius. One does not see 
any seed of this idea in the treatises of his predecessors Aryabhata and Bhaskara 


5n English: Thus Spake Brahmagupta. 
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I or in the Baksali manuscript. None of the results in these remarkable texts come 
close to the algebraic sophistication shown by Brahmagupta's research on varga- 
prakrti. In several other results of Brahmagupta, one discerns a similar abrupt leap 
in mathematical maturity. 


7. Cakravala 


In the Laghu-Bhaskariya (Ch 8; verse 18) of Bhaskara I, there is a problem from 
astronomy involving the simultaneous quadratic indeterminate equations 8x + 1 = 
y? and 7y? + 1 = z? ((Shu3, p 101]). By inspection, one can quickly arrive at the 
integral solution y — 3, z — 8 forthe second equation. In the context of this special 
equation, Udaydivakara, in his commentary on the Laghu-Bhdaskariya, refers to 
Jayadeva's general cakravála method for solving any equation Dx? -- 1 — y?. 
Jayadeva's verses on the cakravala algorithm have been published in ([Shul1]); a 
portion has been reproduced in the Appendix. Bhaskara II’s verses on this algorithm 
occur in Bijaganita ([Ab, p 23]. [P. p 76]); they have been quoted in M.S. Sriram’s 
article [Srr, p 167]. 


The Algorithm 


Recall that (p, q; m) denotes a triple satisfying Dp? + m = q?. The underlying 
idea of the algorithm can then be put as follows: from an integer triple (Pn, Gn: Mn) 
such that [m, | is “small”, one has to construct an integer triple (Pn+1, Qn--1: 151) 
with |77,4 | "small" eventually arriving at an integer triple of the form (p, q; 1). It 
will be convenient for our discussions to assume that, at each stage, the integers 
Pn and m, are mutually coprime. Observe that, if, in an integer triple (p, q; m), the 
integers p and m have a GCD g, then the relation Dp? +m = q? will show that g is 


a factor of q and g? a factor of m and hence (2. ERE z) will also be an integer triple. 


Thus, replacing (p, 4; m) by (2,2 1; z2) if necessary, we may ensure that p and m 
are relatively prime. 

Integer triples of the type (1, y; y? — D) were eu “in the air” among Indian 
algebraists who expounded on Brahmagupta’s work.?? The authors of cakravala 
invariably begin with an initial triple (po, qo: 7o) of this type: po = 1, go a positive 
integer for which |go” — D| is minimum, and mo = qo? — D. 


S2 $ripati (and successive algebraists) clearly explains — whati is tersely indicated by Brahmagupta 


— that applying bhavana on two copies of any triple (1, 4:4? — D), one gets the rational solution 
x= ip = G22 of Dx? + 1 = y? and that infinitely many roots of this equation can be obtained 
by varying q and falso by repeating the bhdvand ([Sih, p 40]; (DS2, p 153:150]). The possible i impact of 
this observation on the discovery of the cakravdla is discussed in [Du1. p 90-92: 96-97]. 
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| We now explain the inductive step. Suppose that one has completed n steps and 
iiss arrived at an integer triple (Pn, qn; Mn) with p, and m, coprime. Composing this 
113134 (known) integer triple (pn, qn; Mn) with the (variable) integer triple (1, y; y? — D) 
Iti Li by samasa-bhavana, one gets an integer triple ( p, y + qu, Dp, 4- quy; ms (y? — D)), 
ITI that is, the identity 
At ; 


MIL D(Pny + qn. + may? — D) = (Dp, + any)’. 


l Dividing the above identity by m,,” (an idea which too could have arisen from the 
nil methods for obtaining rational solutions), one obtains a rational triple 


! ny + n Dpn + ny y — D 
| 1 m (patis Qn 1i Mnt) = (arie Peter r2). 
UMP My, Mp Mp 
dd 
i T li: Now the identity qn” = D p,? +m, shows that 
| j 
li | 


Mn Pn+14n = (PnY + dn)dn = Pn(DPn + quy) + Mn. 


Suppose that pn+ı is an integer. As p, and m, are coprime, it would then follow 
that m, divides Dp, + q; y, that is, q,4.; is an integer. It also follows that mn+1ı(= 
Dp? 4 — qn41.) is an integer. Further, one has the relation pj414; — Pngn+i = 1 
which shows, by the argument made at the outset, that p,4; and m;4, are 
cop: 


fan hence 4,41, 7154.1) becomes an integer. This amounts to finding integer 
ns of the equation m;x — psy = gn. As we saw in section 5, the linear 
i uation had been extensively discussed by Indian algebraists from 

) yabhata (499 CE) and they knew the complete solution to the problem. 
itely many solutions, Jayadeva and Bhaskara II choose a 


process terminates after a finite number of steps (that 
ut proof does not occur in any ancient text. Note 
Pg e m, € {+1, X2, +4}; for, in that 

ormulae (Theorem 2) to arrive at a solution of 


€ 


d BPE : 3 i" 
;'assertthat one will always arrive at such a 


Tora E à 
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An Example 


Quite often Brahmagupta’s result gives a very effective short-cut for numerical 
computations as the solution of the famous example of Bhaskara II ([Ab. p 24]; [P, 
p 77] shows: 


Example 2 (Bhaskara II). Solve, in positive integers, 61x? + 1 = y?. 
Solution. As 64 is the perfect square nearest to 61, we have the initial triple (1, 8; 3). 


Now one finds positive integer y for which 4* is an integer and |y? — 61 is 
minimised. Clearly y — 7. Now 
7: 8 ORT 


es dca. 
3 3 


7? —61 
= —4, 


Thus we have the second triple (5, 39; —4). Now rather than continuing the 
cakravala, one can apply Brahmagupta’s formula (Theorem 2) on the triple 
(5, 39; —4) to immediately get the minimum positive integral solution (226153980, 
1766319049) of 61x? + 1 = y?. " 

The article ((OR]) works out solutions to the examples of Narayana using the 
cakravála aided by the bhavand. The minimum positive integral solution for 103x? 4- 
1 = y? is (22419, 227528) and for 97x? + 1 = y? is (6377352, 62809633). 


Application to Rational Approximation 


Narayana applies the cakravala-bhavana method of generating arbitrarily large 
solutions of Dx?--1 — y? todetermine progressively better rational approximations 
of V/D ([D1]; [Dul, p 93; 102-3]). Note that, if Da? + 1 = b?, then 


b b? — Da? l 


D = ——— = — 
a a(db+aJ/D)  a(b-- aJ D) 


and thus, for a sufficiently large solution (a, b), 2 will be a good approximation for 


a 
4 D. Narayana cites two numerical examples: 4/ 10 and AE . We discuss the former; 
the case of m is similar. 


=e : : = ; 19 721 27379 
For 4/10, Narayana mentions the rational approximations ¢, 55; and "sg. LO 


see how the fractions arise, consider the notation of section 6 for D = 10. Since 9 
is the perfect square nearest to 10, one has the initial triple (1, 3; — 1). Now, 
(1,3; —1) O (1, 3; -1) = (6, 19; 1), 
(6, 19; 1) © (6, 19; 1) = (228, 721; 1), 
(6, 19; 1) © (228, 721; 1) = (8658, 27379; 1). 
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E 19 721 27379 
Ta" Thus one has the three successive fractions —, 55; and [<> as rational approxima- 


Win. tions to v10. The usefulness of the method can be seen from the fact that the third 
tH i fraction wA (= 3.162277662 . . . ) matches the value of 4/10 (= 3.162277660... ) 

i up to nine decimal digits. This method of generating successively closer approx- 
imations was restated by Euler in 1732 ([D1, p 188]). It is now known that if 
{(Xn, Yn) |n > 1) is the sequence of positive integral solutions of Dx? + 1 = y?, 


TNT | then, among all fractions with denominator x, or less, 2: is the best approximation 
"n to VD. This follows from a basic theorem in continued fractions ([BrC, p 399-401; 
M1 535]; [G, p 63-64; 71—72]). In particular, Narayana’s approximations for 4/10 are 
[i ! . . . e. . 

the best possible (with the respective restrictions on the denominators). One can 
similarly see that the Sulba approximations for 4/2, mentioned in section 1, are 
H again the best possible. For the equation 2x? + 1 = y?, the minimum positive 
ui! integral solution is clearly (2, 3). Now the bhàvaná relations 


(2, 3; 1) © (2, 3; 1) = (12, 17; 1) and (12, 17; 1) © (12, 17; 1) = (408, 577; 1) 


Tr yield the two Sulba fractions 12 and 377. It can also be seen, directly or from the 
theory of continued fractions, that the Sulba fraction 2 is the best approximation to 
/2 among all fractions with denominator < 5. 


Some Perspectives on the Cakravála 


We now make a few observations. 


j _ (i) The cakravala algorithm demonstrates a marvellous interplay between the 
two great preceding works — the kuttaka and the bhavana. As Weil observes ([We2, 
p 22]: 


Asi is! the case with many brilliant discoveries, this one [cakravala] can be seen in retrospect 
as CEDE quite naturally from the earlier work [samasa-bhavani]. 


dagogic potential of this development can hardly be overstated. A student can 
n*i s example”? of how first-rate research flows: how Aryabhata 
ithm for computing the GCD to make a "descent" into 


i pta approaches the harder problem of 
ie rilliant bhavand yielding a parti il 


predecessor) assimilates both the — 
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(ii) Expositions on the cakravála often tend to identify it with the Brouncker- 
Euler-Lagrange procedure. But there is a difference. In the notation used above, the 
Brouncker-Euler-Lagrange method would amount to choosing quantities y < D 
for which D — y? is minimised (subject to conditions above), while the cakravala 
of Jayadeva and Bhaskara II stipulates that | D — y?| is minimised.^* The steps in the 
former method correspond to certain convergents of the standard continued fraction 
expansion of VD; the steps in the latter correspond to a generalised expansion 
where negative remainders could be involved. For instance, for the case D — 61(— 
7? + 12 = 8? — 3), the starting point in the former procedure will be 7, while in the 
latter it will be 8. 

The cakravála can also be interpreted in the framework of the regular (i.e., the 
usual) continued fraction expansion of VD (see [Sig]); its steps correspond to fewer 
convergents of the expansion than the steps of Brouncker's method. For instance, 
the first three terms of the continued fraction expansion of 4/61 are 7, i and 2 
(with 7 < 2 < 61 < $3); the cakravála involves only Ë and 22. As discussed 
in section 5, there are indications that ancient Indian mathematicians had possibly 
discovered principles of continued fractions. The term cakravála (cyclic) could be 
an allusion to the periodicity of a continued fraction expansion of VD (see [Srl, p 
286]) or some equivalent concept. 

(iii) The descriptions of cakravála by Jayadeva and Bhaskara II begin with 
the inductive step of constructing (Pr+1, 9n+13%n+1) from (Pn, 4ni Mn) without 
mentioning any starting point (po, qo; mo). Possibly the authors of cakravdla were 
flexible regarding the choice of the initial triple (po, go; mo); any three convenient 
integers p,q,m satisfying Dp? +m = q? could be the starting point.” They 
of course had an obvious choice for the initial triple, that is, a triple of the form 
(1, q:q? — D) which had already become common in the study of varga-prakrti. In 
the illustrative examples, they invariably display the triple (1, go; qo” — D) where 
qo is an integer y for which |y? — D| is minimum. As triples of the form (1, y; y? — 
D) were highlighted, in any case, as a central tool in the cakravala verses, a 
separate mention of (1, go; qo? — D) was perhaps felt unnecessary." Incidentally 
the triple (1, 4; 4? — D) finds explicit mention in Jayadeva's verse on the solution 
of Dx? +m = y? occurring immediately after his verses on Dx? + 1 = y? 
({Shul, p 14]). 

Perhaps some of the writers of books on History of Mathematics, who erro- 
neously state that the cakravála involves “trial-and-error”, get confused by the 


54Naráyanà appears to differ slightly on this point. While he is silent on the minimisation in his 
description of the cakravála, in his examples, he chooses y for which |y — JD] is minimised; see [Sig]. 

>5Recall that in the kuttaka too there was a flexibility of stopping at some stage k (rather than 
completing the long division for computation of the GCD) if one could see, by inspection, the solution 
of some intermediary linear equation. 

SÓRecall the culture of brevity in ancient Indian treatises discussed at the beginning of section 5. 
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apparent obscurity (in the Eanslsuons of the verses) regarding the initial triple. 


They do not realize ar triples (1, 4; q? — D) were taken for granted by the ancient 
authors of cakravala.>’ 


(iv) As observed by A.A. Krishnaswamy Ayyangar ([Krl, p 234]), in retrospect, 
the original cakravala can be simplified by avoiding the kuttaka. Denote by y, 
the integer y which satisfies m, Pn41 — pu yn41 = 4n. Thus 


= Pn-iYn + dn-1, "E DPn=1 + qn-1Yn ; bs =D) 


; Mn = 


; > 
Mn-1 Mn—ı Mn—ı 


so that p, y, — qn = pPn-ıMn and hence 


c PuYns +t) Lo PnlYn+1 + Yn) 


Mn Mn 


in particular, y,4; + y; = 0 (mod m, ). Hence, instead of finding p,+1 by kuttaka, 
one can construct y,,; inductively as follows: start with po = 1, yo = qo a 
positive integer y for which |y? — D| is minimum, and mo = yo? — D; having 
constructed Yn, Pn, Gn, Mn With Pn, Gn, m, being mutually coprime, choose y,,+1 to 
be a positive integer y satisfying the simple congruence y = —y, (mod m,,) for 
which |y? — D] is minimal. Then p41 : = Bnet» — Pn—1 is an integer; and, as 
before, the corresponding gn41, 7,44 will also | be integers with Pp+41, Qn 1, ln 
being mutually coprime. 
dud _ Simplifications are observed after initial breakthroughs. But, as mentioned ear- 
1 there i is a certain richness of thought in the ingenious approach through the 
y DRM bhaàvana which would be good for students to imbibe. For 


ating new solutions out of given 


MM ES yi: 


[1] 
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Indeed, Indians had a fascination for large numbers from the Vedic times.°” The 
facility with large numbers was conducive for the Indians in their research on the 
Pellian equation, which soon involves large solutions. 


(vii) The early occurrence of Pell’s equation in Indian history can also be seen 
in the light of an observation of Legendre ([G, p 3]): 


... it appears that the ancient philosophers™ did rather extensive research on the properties 
of numbers. But they lacked two tools needed to reach the depths of this science: the art of 
numeration, which serves to express numbers with great ease, and algebra, which generalizes 
results and can operate equally on the known and the unknown. The invention of both these 
arts greatly influenced the progress of the science of numbers. 


Greeks had generally neglected computational mathematics and it is significant 
that Archimedes, who had suggested a problem which was later seen to involve 
Pell’s equation, also happened to be the one Greek mathematician who had re- 
alised the importance of an efficient system of enumeration and had tried to evolve 
one. 

While the “art of numeration” had been perfected in India well before the 
time of Aryabhata, the Indian algebraists from Brahmagupta onwards had also 
become adept in the art of formation and clearance of equations. Brahmagupta 
used the term samikarana (making equal) for equation, varna (colour-names) for 
unknowns and ripa (appearance) for coefficients, described a plan for writing equa- 
tions and then preparing them, and classified equations. Further details are given 
in [DS2]. 

(viii) Perhaps it can never be ascertained whether ancient Indian algebraists had 
conceived of a proof?! that the cakravala method always works. Even if they had a 
proof, given the technicalities involved, it would have been difficult to express it in 
those days. After all, even Fermat had not written (or is not known to have written) a 
proof, although he claimed that he had a proof by “descent”. While analyzing what 
might have been Fermat's proof, Weil provides a valuable insight into Fermat's 
possible difficulty in formulating it ([We2, p 99]): 


It does not seem unreasonable to assume that this may have been, in general outline, what 
Fermat had in mind when he spoke of his proofs in his letters to Pascal and Huygens; how 
much of it he could have made explicit must remain a moot question. Perhaps one of his 


59A verse of Medhatithi in the Vajasaneyr Samhita (XVII.2) of the Sukla Yajurveda gives number- 
names for each power of ten up to 10/2; much larger numbers are given in the epic Ramdyana and the 
Jaina and Buddhist literature. Fora few examples, see R.C. Gupta, World's longest lists of decuple terms, 
Ganita Bharati Vol 23 (2001), p 83-90. 

SOT egendre was referring to ancient Greeks; he was not aware of ancient Indian research in number 
theory. 

9! Ancient Indians had the notion of upapatti which corresponds to "proof"; see the article by M.D. 
Srinivas ([Srn]) for a comparison of the two concepts. 
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most serious handicaps was the lack of the subscript notation; this was introduced later by 
Leibniz, in still imperfect form, and its use did not become general until rather late in the 
next century. Faced with a difficult or complicated proof, Fermat might well be content with 
a careful analysis of some typical numerical cases, convincing himself at the same time that 
the steps involved were of general validity. Actually he never succeeded in writing down his 
proofs. 


Now the handicaps of Fermat applied all the more to Jayadeva, Bhaskara II and 
their predecessors who had no better writing-aid than perishable palm-leaves. Also 
note that the theoretical justification for cakravala is more complicated. Besides, 
as mentioned at the beginning of section 5, the great Ácáryas (Masters) in ancient 
Indian tradition usually stated only the essence of their results (and that too in verse 
form). Details were left for oral transmission; but the unfortunate break in oral 
tradition, brought about by foreign invasions, resulted in the loss of significant links 
in mathematical and scientific knowledge. But the very nature of the cakravála (and 
the term itself) suggests that, like Euler, the Indians must have had a deep insight 
into the problem if not a complete proof. 


The history of ancient Indian algebra, especially that of the kuttaka, the bhavana 


and the cakravala, is a grand illustration of the following remark of Sri Aurobindo 
([A, p 185]): 


Especially in mathematics, astronomy and chemistry, the chief elements of ancient science, 
she [India] discovered and formulated much and well and anticipated by force of reasoning 
or experiment some of the scientific ideas and discoveries which Europe first arrived at much 
later, but was able to base more firmly by her new and completer method. 


8. Brahmagupta as a representative of The Classical Age 


The kuttaka, the bhàvana and the cakravdla are fruits of the Classical Age? in Indian 
history, a period of intellectual flourish, a period of a magnificent efflorescence in 
myriad fields of human endeavour. To a thinker who is aware of the spirit of the 
times, the Indian achievements on algebra in general, and quadratic indeterminate 
equations in particular, do not come as a surprise.5? Thus a familiarity with the broad 
features of the cultural history of ancient India is desirable for fresh thinking on any 


` topic in ancient Indian science. A deep insight into the history can be had from Sri 


Aurobindo's essays in [A]. 


62The “Classical Age" usually refers to the Gupta period (beginning from 320 CE), the period often 
identified as the “Golden Age” of post-Vedic Indian history. Many features of the Classical Age can be 
seen in the history of the preceding centuries, from the time of Mauryas (from around 300 BCE) at least. 
- 63 Unfortunately, there have been scholars from both India and the West who have nurtured the image 


of the ancient Indian mind as being exclusively other-worldly. 
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The early epoch of the Veda and the Upanisads, luminous with the discovery of 
the Spirit, was the "heroic creative seed-time" ({A, p 110]) — the first formative 
period of the deeper and finer aspects of the Indian civilisation. This inspired period 
of great spiritual outflowering passed away into an age of strong intellectuality. In 
the words of Sri Aurobindo ([A, p 148]): 


The second or post-Vedic age of Indian civilisation was distinguished by the rise of 
the great philosophies, by a copious, vivid, many-thoughted, many-sided epic literature, 
by the beginnings of art and science, by the evolution of vigorous and complex society, by 
the formation of large kingdoms and empires, by manifold formative activities of all kinds 
and great systems of living and thinking. Here as elsewhere, in Greece, Rome, Persia, China, 
this was the age of a high outburst of the intelligence working upon life and the things of the 
mind... 


It was a birth time and youth of the seeking intellect. . . . 


In the Classical Age, each department of knowledge or activity, once taken up, 
was pursued to its extreme with an amazing intensity, a powerful, penetrating and 
scrupulous intelligence. Sri Aurobindo explains ([A, p 402—403 ]): 


For the third power of the ancient Indian spirit was a strong intellectuality, at once austere 
and rich, robust and minute, powerful and delicate, massive in principle and curious in 
detail. ... 


The mere mass of the intellectual production during the period from Asoka well into the 
Mahomedan epoch is something truly prodigious, as can be seen at once if one studies the 
account which recent scholarship gives of it, and we must remember that that scholarship as 
yet only deals with a fraction of what is still lying extant and what is extant is only a small 
percentage of what was once written and known. . .. 


In each subject from the largest and most momentous to the smallest and most trivial there 
was expended the same all-embracing, opulent, minute and thorough intellectuality . . . 


Brahmagupta is a striking embodiment of this vibrant intellectuality. This ver- 
satile genius systematises arithmetic, lays the foundation of algebra, takes the 
amazing leap into Pell's equation, records an ingenious construction of rational 
cyclic quadrilaterals and the formulae for its area and diagonals, gives second 
order interpolation formula, not to mention the numerous contributions in astron- 
omy. His approach and attitude to mathematical discoveries reveal the mind and 
spirit of a great pure mathematician enthusiastically engaged in vigorous pursuit 
of new results inspired by a feel for their intrinsic mathematical worth irrespec- 
tive of the requirements of immediate applications. His mathematical foresight can ` 
be seen in his realisation of the importance of “algebra” as a distinct branch of 
mathematics. 

Just as Kalidasa is the great representative poet of the Classical Age, Brah- 
magupta is its great representative mathematician; the bhdvand shining as a crest- 
jewel among his mathematical gems. 
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i 9. Varga-prakrti: Motivation and Allied Issues 


HI Brahmagupta's possible motivation for investigating the difficult equation Dx? + 

| ] — y? in early seventh century was discussed in chapter 7 of our earlier paper 
([Dul, p 101-104]). In this section we shall examine a few passages from the book 
[Vn] of van der Waerden™, which represent an approach to the Indian history on 
Pell's equation that is often made by scholars. van der Waerden asks ([Vn, p 148]): 


Brahmagupta was mainly an astronomer. . . . But why should he be interested in the solution 
of Pell's equation? 


i ; 

| Lii The answer can be found in Brahmagupta's text itself. As mentioned earlier (section 
2), towards the end of chapter 18, Brahmagupta gives sukhamatram as the raison 

d'étre for the algebra problems discussed in the chapter: 


These questions are stated simply for delight. 


ut We mention, in this context, that in a lecture on as broad a topic as Mathematics as 
i a basic science, Michael Atiyah® observed: 


Number Theory for its own sake, as a great intellectual challenge, has a long history, particu- 
larly here in India. Already in the 7th century, Brahmagupta made important contributions to 
what is now known (incorrectly) as Pell’s Equation. 


As hinted in section 8, Brahmagupta belonged to a phase of Indian history where 
one sees an insatiable curiosity and an overflowing joy of creation supported by a 
vise ern originality. As Sri Aurobindo observes ([A, p 401]): 


taiea o sms 


at the past of India, what strikes us next is her stupendous vitality, her 

i aes of life and joy of life, her almost unimaginably prolific creativeness. 

i ae 

RIB n 

en (1903-96) was a leading Dutch mathematician of the 20th century. The 

or his influential two-volume treatise Moderne Algebra (1930) 

-fields approach of text books on abstract algebra. This 

1 il Artin, made a systematic presentation of a large 

y Noether and Emil Artin. van der Waerden 

yto the promulgation of the ideas of Emmy Noether, 

e also did i finportant research in algebraic geometry cde 
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Brahmagupta's delight in performing new feats in algebra was in tune with 
the general creative impulse in the cultural atmosphere. After bringing clarity and 
refinement in Aryabhata’s kuttaka, the pure mathematician’s impulse drove him 
to take up the harder problem of the Pellian equation. Note that his penchant 
for number-theoretic problems also finds expression in his chapter on arithmetic 
where he blends his geometric and algebraic skill to give an ingenious construction 
of a cyclic quadrilateral whose sides, diagonals, circum-radius and area are all 
rational and whose diagonals are perpendicular to each other (and other related 
problems). While stating concrete examples, especially those involving the varga- 
prakrti, Brahmagupta often used the phrase kurvannávatsarad ganakah — One 
who can solve it within a year (is truly a) mathematician. Clearly he revelled in the 
challenge posed by the varga-prakrti, the determination of a solution of which was 
described by Jayadeva ([Shul, p 14—15]) as being as difficult as setting a fly against 
the wind. 

van der Waerden overlooks all such internal evidence. He writes ([Vn, p 148]): 


The Greeks had a motive for occupying themselves with Pell's equation. The Pythagoreans 
wanted to find rational approximations for what we call 4/2, and they found the “Side- and 
Diagonal-Numbers". Archimedes needed approximations for 4/3, and he used for this purpose 
solutions of the equations x? = 3y? + 1 and x? = 3y? — 2. 


Missing from van der Waerden's discussion is the fact that, right from the Vedic 
times, there have been efforts by Indian mathematicians to obtain convenient, rea- 
sonably accurate, rational approximations to irrational surds ~ D. The approxi- 


; 1 1 1 28577 : r 
mation | + 4 + el Sea (= aos) for 4/2 was known in the Vedic era and 


recorded by three authors of the Sulba-sütras — Baudhayana, Apastamba and 
Kàtyàyana.Ó As we mentioned in section 1, the denominator and numerator of 
the fraction 277 satisfy the Pell equation 2x? + 1 = y?.* A few examples of 
rational approximations occurring in post-Vedic texts were mentioned in ([Dul, 
p 102-103]). As mentioned in section 7, the algebraist Narayana explicitly used 
large solutions of Dx? + 1 = y? for approximating //D. None of these facts find 
mention in van der Waerden’s book published in 1983. van der Waerden continues 
([Vn, p 149; 154]): 


We also have seen that the methods for obtaining these solutions are closely connected with 
the Euclidean algorithm, applied to irrational ratios of line segments. ... But why should 
Brahmagupta explain these methods, stripped of their original context, in an astronomical 


§7In as early a text as that of Dickson ([Di, p 341]), the Sulba fractions giving approximations to /2 
are mentioned at the beginning of the chapter “Pell equation". 


S Also recall (section 7) that the pair (408, 577) is related to the minimal solution (2, 3) by Brah- 
magupta's identity (2, 3; 1) O (2, 3: 1) O (2, 3; 1) O (2, 3; 1) — (408, 577; 1). 
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treatise? I can find only one explanation: he followed an earlier tradition ultimately derived 
from Greek sources. 


I suppose that the Greeks were able to solve Pell's equation, not only for D — 2 and D — 3, 
but also for higher values of D, by a systematic application of the Euclidean algorithm. I also 
suppose that their methods of calculation were copied, without proofs, in Hindu treatises like 
Brahmagupta's Siddhanta. 


Now this is the analysis of a scholar as erudite and meticulous as van der Waerden. 
What goes wrong? This is not an isolated instance. P. Tannery asserted in 1882 that 
Indians must have got the solution of Pell's equation from Greeks (who must have 
solved it before), several historians repeated this assertion in some form or the other, 
and, a hundred years after Tannery, the writings of van der Waerden (1983) convey a 
similar impression.9 Perhaps, as hinted in section 8, consciously or subconsciously, 
a topic like Pell’s equation was considered to be too sophisticated for any ancient 
mathematician hailing from, what was perceived to be, an other-worldly culture. In 
this connection, we make a few remarks. 


(i) Like Archimedes before him, or Newton and Gauss after him, Brahmagupta 
was a veritable colossus of his time. This perception is yet to sink in, in the circle 
of history of mathematics. Anyone familiar with his mathematical gifts would not 
need to postulate a preceding hypothetical text from where he could have “copied” 
(‘without proof”!) Theorem 1. It is a reflection of the opulent intellectuality and 
the inexhaustible vital creativeness of the Classical Age that Brahmagupta and 
some of his successors were doing prolific research in astronomy as well as in pure 
mathematics with equal vigour and enthusiasm. 


(ii) The Indian achievements on Pell’s equation tend to overshadow the fact that 
ancient Indian algebraists had produced a large bulk of work involving ingenious 
solutions of various types of indeterminate equations (see [DS2] for some examples). 
Most of these equations were clearly investigated for their own sake. While this 
zest for exploration of higher degree indeterminate equations (after solving the 
application-oriented linear case) fits into the general spirit of the era, it was also 
part of a process of infusing a strong algebra-culture in mathematical thinking. 
We have seen in section 2 that astronomer-mathematicians like Brahmagupta and 
Bhaskara II, and mathematicians like Narayana, who pursued or discussed the 
equation Dx? +1 = yer explicitly emphasised the importance of algebra in their 
respective texts. Their statements indicate their passion for algebra as also the 
perception that the cultivation of the subject was a means for sharpening and 
enriching the intellect. 


re 
Sn ([Sr2, p 17-18), R. Sodharty quotes similar statements from a paper of van der Waerden as an 
example of the prejudice of some mathematicians. 
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Just as Fermat made use of Pell’s equation to create interest in number theory, 
Brahmagupta and Bhaskara II too made use of challenging problems in indeter- 
minate equations (among other topics) to promote algebra among students with 
promising intellects. 

(iii) In the context of the last sentence quoted above from van der Waerden, 
we note that proof (or its absence) can be an issue in the context of cakravdla, 
not bhàvaná. Brahmagupta’s section Varga-prakrti consists of Theorem | which 
is, in essence, an algebraic identity, and its repeated applications. All these results, 
once stated, can be verified by routine algebraic operations; and, as explained at 
the beginning of section 5, routine details are not to be expected in a terse (but still 
voluminous) original ancient Indian treatise. Readers can judge the validity of the 
quoted speculation by seeing the phrase about some Greek methods being copied 
without proofs in Brahmagupta’s treatise in the light of the actual contents of the 
treatise. 


(iv) We have seen that Indian history on Pell’s equation shows a gradual devel- 
opment: first there is kuttaka; then the composition law, rational solution and partial 
integer solution to Pell’s equation; and then the complete integer solution (involving 
all preceding ideas). It is generally agreed that Greek mathematics had stagnated 
by the 5th-6th century CE; that is before Brahmagupta. If the Greeks had solved 
the problem before Brahmagupta, and the solution had reached India, why does 
Brahmagupta record (or “copy”) only a partial solution? Or should one presume 
that, by some combination of circumstances, only an initial part of the hypothetical 
Greek treatment on Pell's equation had reached Brahmagupta, and a subsequent 
part reached Jayadeva after a few centuries? 

We mention here that it is only in certain aspects of astrology and astronomy that 
historians have traced a possible transmission of ideas from Greek (or Hellenistic) 
schools to the Indian treatises. There is no influence of the “pure mathematics" of 
the Greeks in the developments in post- Vedic Indian mathematics. Now if there has 
not been transmission of something as standard as Greek geometry’, what were the 
chances of transmission of a topic as esoteric (for the times) as the integer solutions 
of the Pellian equation from the hypothetical Greek sources (and that too in at least 
two phases — one for Brahmagupta, one for Jayadeva)? 

On the one hand, there is concrete textual evidence of research on Pell’s equation 
progressing in phases in ancient India in an environment with a good system of 
enumeration and a clarity regarding formation and handling of equations, with 


70Indians had a strong culture of systematisation of various branches of knowledge. Mathematicians 
belonging to an intellectual environment that was used to the structure of Panini’s Astüdhyáyi would 
have found the postulational style of Greek geometry suited to their temperament, had they got a glimpse 
of it. Someone like Brahmagupta, who could come up with gems on the cyclic quadrilateral, would have 
much more geometry in his treatise if only Greek geometry had reached him. The efforts of Mahavira 
show that Indians did have curiosity regarding conics. 
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I a love for large numbers and a love for algebra (cf. (vi) and (vii) of section 7). 
On the other, there is as yet no concrete evidence of any Greek mathematician 
even investigating integer solutions of Pell's equation for large D and no serious 
| evidence of transmission to India of mathematical ideas that were not intimately 


i connected to astronomy. And yet, there has been a tendency to deny the originality 
f of Brahmagupta and his successors regarding their work on varga-prakrti, or to 
P label it, euphemistically, as "disputed". As H.C. Williams recently put it (Wi, p 
JT 404]): 
| 


H 
"i Finally, there is the belief, apparently due to Tannery, that the cyclic method derives from 
| Greek influences. There seems, apart from possible wishful thinking on the part of Tannery, 
i to be little solid evidence in support of this. The simple fact is that, as mentioned earlier, 
E we don't really know what the Greeks knew about the Pell equation. What we do know, 
r however, is that the Indian methods display a history of steady development and refinement 
: | up to and including the discovery of the cyclic method, and this very strongly suggests 
ii that Hankel's position that the Indians evolved the technique by themselves is the correct 

one. 


Itisto be hoped that the seminar series on History of Mathematics at the Chennai 
Mathematical Institute will be a precursor to more such congregations in India of 
leading mathematicians and historians of mathematics, and that such interactions 
will pave the way for a more accurate, more comprehensive, more representative 
and more insightful account on history of mathematics in the near future. 


Appendix: The Original Verses 
"Sj D we : refrain from quoting original verses for all the statements from 


vhich cribe the three principles kuttaka, bhàvanà and cakravala. English 
ations forthe quoted verses canbe foundin Dias. p 74-84] and [Ke, p 128] Gs 


ae E. vou: 
asi ct a idc e te caw 


EIR 


Kuttaka, Bhavana and Cakravala 


Brahmagupta’s verses on Bhavana 


milarh dvidhestavargad gunakagunádista yutavihinacca 
adyavadho gunakagunah sahántyaghátena krtamantyam 
vajravadhaikyam prathamam praksepah ksepavadhatulyah 


Jayadeva’s verses on Cakravala 


hrasvajyesthaksepan pratirá$ya ksepabhaktayoh ksepat 
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kuttakare ca krte kiyadgunarh ksepakarn ksiptva 
tavatkrteh prakrtya hine praksepakena sambhakte 
svalpataravaptih syadityadkalito’ parah ksepah 
praksiptapraksepakakuttakare kanisthamülahate 
sajyesthapade praksep(ak)ena labdham kanisthapadam 
ksiptaksepakakuttagunitáttasmatkanisthamülahatam 
pascatyam praksepam vi$odhya $esarn mahanmülam 
kuryat kuttákaram punaranayoh ksepabhaktayoh padayoh 
tatsestahataksepe sadrSagune’smin prakrtihine 
praksepah ksepapte praksiptaksepakacca gunakarat 
alpaghnat sajyesthat ksepavaptam kanisthapadam 
etatksiptaksepakakuttakaghatadanantaraksepam 
hitva'lpahatarn Sesarh jyestham tebhyasca gunakadi 
kuryattavadyavat sanndmekadvicatumam patati 

iti cakravalakarane’vasarapraptani yojyani 
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Development of Calculus in India 


K. Ramasubramanian and M. D. Srinivas 


In this article we shall present an overview of the development of calculus in Indian mathematical 
tradition. The article is divided naturally into two parts. In the first part we shall discuss the de- 
velopments during what may be called the classical period, starting with the work of Aryabhata 
(c. 499 CE) and extending up to the work Narayana Pandita (c. 1350). The work of the Kerala 
School starting with Madhava of Sangamagrama (c. 1350), which has a more direct bearing 
on calculus, will be dealt with in the second part. Here we shall discuss some of the contributions 
of the Kerala School during the period 1350-1500 as outlined in the seminal Malayalam work 
Yuktibhasa of Jyesthadeva (c. 1530). 


PART I : THE CLASSICAL PERIOD 
Aryabhata to Narayana Pandita (c. 500-1350 CE) 


1. Introduction 


In his pioneering history of calculus written sixty years ago, Carl Boyer was to- 
tally dismissive of the Indian contributions to the conceptual development of the 
subject.! Boyer's historical overview was written around the same time when (i) 
Ramavarma Maru Thampuran and Akhileswarayyar brought out the first edition 
of the Mathematics part of the seminal text Ganita-yukti-bhasa, and (ii) C.T. Ra- 
jagopal and his collaborators, in a series of pioneering studies, drew attentior 
the significance of the results and techniques outlined in Yuktibhàsà (and } 

of the Kerala School of Mathematics in general), which seem to t 
ten after the initial notice by Charles Whish in early nineteenth centur 
the subsequent studies have led to a somewhat di ferent erceptio 


s may 


£ 
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} We have here a prime example of two traditions whose aims were completely 

different. The Euclidean ideology of proof which was so influential in the Is- 

| lamic world had no apparent influence in India (as al-Biruni had complained long 

before), even if there is a possibility that the Greek tables of ‘trigonometric func- 

i tions’ had been transmitted and refined. To suppose that some version of ‘calcu- 
lus’ underlay the derivation of the series must be a matter of conjecture. 

The single exception to this generalization is a long work, much admired in 
Kerala, which was known as Yuktibhasa, by Jyesthadeva; this contains some- 
thing more like proofs—but again, given the different paradigm, we should be 
cautious about assuming that they are meant to serve the same functions. Both 
the authorship and date of this work are hard to establish exactly (the date usu- 
j ally claimed is the sixteenth century), but it does give explanations of how the 
] formulae are arrived at which could be taken as a version of the calculus. 


L + 
? l The Malayalam work Ganita-yukti-bhasa (c. 1530) of Jyesthadeva indeed 
i presents an overview of the work of Kerala School of mathematicians dur- 
ai ing the period 1350-1500 cE. The Kerala School was founded by Madhava 
ns i (c. 1340-1420), who was followed by the illustrious mathematician-astronomers 


Parame$vara (c. 1380-1460), his son Damodara and the latter's student 
Nilakantha Somayaji (c. 1444—1550). While the achievements of the Kerala 
School are indeed spectacular, it has now been generally recognised that these are 
in fact very much in continuation with the earlier work of Indian mathematicians, 

especially of the Aryabhatan school, during the period 500-1350 CE. 
In the first part of this article, we shall consider some of the ideas and meth- 
ods developed in Indian mathematics, during the period 500-1350, which have a 
4 bearing on the later work of the Kerala School. In particular, we shall focus on the 
following topics: Mathematics of zero and infinity; iterative approximations for 
irrational numbers; summation (and repeated summations) of powers of natural 
numbers; use of second-order differences and interpolation in the calculation of 
or Rsines; the emergence of the notion of instantaneous velocity of a planet in 

ronomy; and the calculation of the surface area and volume of a sphere. 


er 
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from [that] pürna; even when pürna is drawn out of pürna, what remains is 
also pürna. 


Panini's Astadhyayz (c. 500 BCE) has the notion of lopa which functions as a 
null-morpheme. Lopa appears in seven sütras of Chapters 1, 3, 7, starting with 


WMA BT: 11.1.60). 


Stinya appears as a symbol in Pingala’s Chandah-sitra (c. 300 BCE). In Chapter 
VIII, while enunciating an algorithm for evaluating any positive integral power 
of 2 in terms of an optional number of squaring and multiplication (duplication) 
operations, śūnya is used as a marker: 


ey TAA | fg: gr (8.29-30). 


Different schools of Indian philosophy have related notions such as the notion of 
abhàva in Nyaya School, and the sunyavada of the Bauddhas. 


2.2. Mathematics of zero in Brahmasphuta-siddhanta (c. 628 CE) of 
Brahmagupta 


The Brahmasphuta-siddhanta (c. 628 CE) of Brahmagupta seems to be the first 
available text that discusses the mathematics of zero. Sünya-parikarma or the 
six operations with zero are discussed in the chapter XVIII on algebra (kuttaka- 
dhyàya), in the same six verses in which the six operations with positives and 
negatives (dhanarna-sadvidha) are also discussed. Zero divided by zero is stated 
to be zero. Any other quantity divided by zero is said to be taccheda (that with 
zero-denominator):? 


3 Brahmasphutasiddhanta of Brahmagupta, Ed. with his own commentary by Sudhakara 
Dvivedi, Benaras 1902, verses 18.30-35, pp. 309-310. 
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| GI Vs ar aura VAS ari 

RAGA: du wea ws qug aa 

..[The sum of] positive (dhana) and negative (rna), if they are equal, is zero 

kj (kham). The sum of a negative and zero is negative, of a positive and zero is 
a positive and of two zeros, zero ($unya). 
IN ... Negative subtracted from zero is positive, and positive from zero is negative. 
| Zero subtracted from negative is negative, from positive is positive, and from zero 
is zero (akasa). ... The product of zero and a negative, of zero and a positive, or 
of two zeroes is zero. 
...A zero divided by zero is zero. ... A positive or a negative divided by zero is 
that with zero-denominator. 


n 


4 
Pi 2.3. Bhāskarācārya on Khahara 


pi i Bhaskaracarya II (c. 1150), while discussing the mathematics of zero in Bzja- 
ganita, explains that infinity (ananta-rasi) which results when some number is 

i divided by zero is called khahara. He also mentions the characteristic property of 
infinity that it is unaltered even if ‘many’ are added to or taken away from it, in 
terms similar to the invocatory verse of [savasyopanisad mentioned above:^ 


Get Had Ga aes uf 1 


RY fed d wed eee ae d wd wd 
FTAA ajo UAT: GER: Sad | 


AAAF: wet a wumafu wfaPwrfu gaa) 

 Wgeifü meager savas ATTY aga 

(A divided by zero will be (called) eei (an entity with zero as di- 

2 ; divided by zero ... This infinite (ananta or that without 
ERU 

a, there is no alteration even if many are added or taken 


poration. in. the Infinite (ananta), Infallible (acyuta) 
groups of beings enter in or emanate from [It] at 


ero ii Lilavatz, notes that 
/ bei g nu tiplied by zero 
state A the quan- 
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He follows this up with an example and declares that this kind of calculation has 
great relevance in astronomy: 


at @ aces antl @ wordt ais: | 
et Vat a: xp asses safes i 
TÀ Ur amd a eae VIRI: | 


ARFA va uz. Sag JA: 1 | 
a yagua fe ae aea d l 


JA A aE uS TA 
SUD F: “wot fai- | 


qatar: agafen: [ 
FATA UT: cred Ur: oq TT AT: P/Q: ad: 


ol gaa ea | 
ad aar faarafattar ger ar cet ufs: 241 
we "fore Testers ger quur: | 


..A quantity multiplied by zero is zero. But it must be retained as such when 
uim operations [involving zero] are contemplated. When zero is the multiplier. 
of a quantity, if zero also happens to be a divisor, then that quantity remains 
unaltered ... 

.. What is the number which when multiplied by zero, being added to half of 
itself multiplied by 3 and divided by zero, amounts to sixty-three? 

.. Either following the inverse process or by choosing a desired number for the 
(mft (‘rule of false position’), the quantity is obtained to be 14. This kind of 
calculation is of great use in mathematical astronomy. 


Í 


Bhāskara works out his example as follows: 
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a wqniwfedt vf want aT ga: | 
MURA CHAS SIT: WAM ATA Il 
í Say what is the number which when added to half of itself, multiplied by zero, 


| squared and the square being augmented by twice its root and divided by zero, 
L becomes fifteen? 


Clearly the problem as stated is 


xy xX 
[0G + 3)] tx [o 2] _ s O 


Bhaskara in his Vasana seems to just cancel out the zeros without paying any 
heed to the different powers of zero involved. He converts the problem into the 
equation 


psi. lie 9^ tme n 


[x+ 3] 42x [k+] =15. (3) 


Solving this, by the method of elimination of the middle term, Bhaskara obtains 
the solution x — 2. The other solution (-2) is not noted. 


3. Irrationals and iterative approximations 


LY y 


-Sulva-siitra gives the following approximation for 42:7 
gera aires warfare i af: | 


increased by its third and this [third] again by I 
part [of the fourth). That is the approximate : 


p. 
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The above approximation is accurate to 5 decimal places. Baudhayana-sulva- 
sutra—in the context of discussing the problem of circling a square—also gives 
an approximation for z : ® 


If it is desired to transform a square into a circle, [a cord of length] half the 
diagonal of the square is stretched from the centre to the east; with one-third [of 
the part lying outside] added to the remainder [of the half-diagonal] the [required] 
circle is drawn. 


If a is half-the side of the square, then the radius r of the circle is given by 
a 
r= (5) e ya. (5) 


This corresponds to z 7 3.0883. 


3.2. Algorithm for square-roots in Aryabhatzya 


The Aryabhatiya of Aryabhata (c. 499 CE) gives a general algorithm for com- 
puting the successive digits of the square root of a number. The procedure, given 
in the following verse, is elucidated by us via an example:? 


ani eae Hees why tap Re 
qute sra evel QATAR AAAI 


5146025 

4 9 
Always divide the non-square (even) place by payer eo (5 
twice the square-root [already found]. Having sub- 7 0 
tracted the square [of the quotient] from the square TR 
(odd) place, the quotient gives the [digit in the] 25 
next place in the square-root. ADEM 


Gamitapada4 i 36. 
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THAMES qu edm 
He sm fear fasta i 


| Multiply the non-square number by some large square number, take the square- 
f root [of the product] neglecting the remainder, and divide by the square-root of 
A. the multiplier. 


Narayana Pandita (c. 1356) has noted that the solutions of varga-prakrti (the 
so called Pell’s equation) can be used to compute successive approximations to the 
square-root of a non-square number:! 


| HS Wel TSI FT TITAS Ves TA! 
a Se ETA I TAR ASAT 
l 


l 

| 

t 

| 3 ' [With the number] whose square-root is to be found as the prakrti and unity as 
T the ksepa, [obtain the greater and smaller] roots. The greater root divided by the 
i lesser root is an approximate value of the square-root. 

i 


Narayana considers the example 


10x? - 1 — y?, (6) 
and gives the approximate values: 


I) 72 AWE) 
7 
T 6’ 228' 8658" y 


which are obtained by successive compositions (bhavana) of the solution x = 


(o DE 


28 = = 90», 721 = (10)(6)? + (19), and so on. 


Aryabhatiya 


s the following approximate value for 7 ge 
T 


GIS gita? 


bien i ei i (Marsi 
oye Part II, eiras 1942, 
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One hundred plus four multiplied by eight and added to sixty-two thousand: This 
is the approximate measure of the circumference of a circle whose diameter is 
twenty-thousand. 


cy 92832 _ 
Thus as per the above verse z ~ 55555 = 3.1416. 


3.5. Successive doubling of the sides of the circumscribing polygon 


It appears that Indian mathematicians (at least in the Aryabhatan tradition) 
employed the method of successive doubling of the sides of a circumscribing 
polygon—starting from the circumscribing square leading to an octagon, etc.— 
to find successive approximations to the circumference of a circle. This method 
has been described in the later Kerala texts Yuktibhasa (c. 1530) of Jyesthadeva 
and Kriyakramakari commentary (c. 1535) of Sankara Variyar on Lilavati, of 
Bhaskara II. The latter cites the verses of Madhava (c. 1340-1420) in this con- 
nection and notes at the end that:!4 


Ua UuTq« IE WpEHGTHTGISÉAG TAH 
Thus, one can obtain [an approximation to the circumference of the circle] to any 
desired level of accuracy. 
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We now outline this method as described in Yuktibhasa.'> In Figure 1, EOSA 
is the first quadrant of the square circumscribing the given circle. EA, is half the 
side of the circumscribing square. Let OA; meet the circle at Cj. Draw A2C) B2 
parallel to ES. E A» is half the side of the circumscribing octagon. 

Similarly, let O A2 meet the circle at C2. Draw A3C2B3 parallel to EC). EA3 
is now half the side of a circumscribing regular polygon of 16 sides. And so on. 
Let half the sides of the circumscribing square, octagon etc., be denoted 


lj = BA, L= EA25, l5 = EAs, ... (8) 
The corresponding karnas (diagonals) are 
ki = OA, k2 = OA2, k3 = O A3, ... (9) 
And the abhadhas (intercepts) are 
= Di Ai, a = D2A2, a3 = D3A3, ... (10) 
Now 


y 
lr, k—42r and a = —. (11) 
1 1 1 Vi 

Using the bhuja-koti-karna-nyàya (Pythagoras theorem) and trairasika-nyaya 
(rule of three for similar triangles), it can be shown that 


l 
b = h-(a-r) + (12) 

2A ai 
Ej = r+B (13) 


eee. E. 2 (2 cp 
SEE Sere sant ES LEGEA (14) 
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4. Summation (and repeated summations) of powers of natural numbers 
(sankalita) 


4.1. Sum of squares and cubes of natural numbers in Aryabhatiya 
The ancient text Brhaddevata (c. 5! h century BCE) has the result 
2+3+...+ 1000 = 500, 499. (16) 


Aryabhata (c. 499 CE), in the Ganitapada of Aryabhatiya, deals with a general 
arithmetic progression in verses 19—20. He gives the sum of the squares and cubes 
of natural numbers in verse 22:7 


eure eared fauafqur WEISS: | 
TRAN: a ad fufqast wafafeua i 


The product of the three quantities, the number of terms plus one, the same in- 
creased by the number of terms, and the number of terms, when divided by six, 
gives the sum of squares of natural numbers (varga-citi-ghana). The square 
of the sum of natural numbers gives the sum of the cubes of natural numbers 
(ghana-citi-ghana). 


In other words, 


n(n + 1)(2n + 1) 


1?.422221:3? E = 7) 
13 4.23 4:3? +, on) TESORO ENTE 
2 
EL 2 (18) 


4.2. Repeated sum of natural numbers in Aryabhatzya 


Aryabhata also gives the repeated sum of the sum of the natural numbers (s a1 
lita-sarikalita or vàra-sarikalita):!8 T 


EEESEUCHE LP GEE THEO raat: | 
age: ow füfqua: Seca f 
Of the series (upaciti) 1, 2, . rS 
first is the given number of terms (g; 
or the number of terms plus one sub 
Muni bue. 


f 
1 


212 K. Ramasubramanian and M. D. Srinivas 
We have 


n(n + 1) 


1+2+3+...+n= 2 


(19) 


Aryabhata's result expresses the sum of these triangular numbers in two forms: 


(Get) Oza) m+) 0 QD 2) 
I m ap 2 "ROS PUT atr UU. — 
et 
= pecu (20) 


4.3. Narayana Pandita's general formula for Varasarikalita 


In his Ganita-kaumudz, Narayana Pandita (c. 1356) gives the formula for the 
r^ order repeated sum of the sequence of numbers 1, 2, 3, ..., n:}9 


Verf and TRET yes ASN: | 
] 


The pada (number of terms in the sequence) is ‘the first term [of an arithmetic 

progression] and 1 is the common difference. Take as numerators [the terms 

in the AP] numbering one more than vara (the number of times the repeated 

summation is to be made). The denominators are [terms of an AP of the same 
~ length] starting with one and with common difference one. The resultant product 
cs is vara-sarikalita. 


n(n + 1) 


IE DL LISELEEPELITI 2 


= y D, 


Development of Calculus in India 213 


This result can be used to evaluate the sums 5 ., ge, k=l k?, ... by induction. 
It can also be used to estimate the behaviour of these sums for large n. 


4.4. Summation of geometric series 


The geometric series 1 + 2 + 22 +...2” is summed in Chapter VIII of Pingala’s 
Chandah-sütra (c. 300 BCE). As we mentioned earlier, Pingala also gives an 
algorithm for evaluating any positive integral power of a number (2 in this context) 
in terms of an optimal number of squaring and multiplication operations. 

Mahaviracarya (c. 850), in his Ganita-sara-sarigraha gives the sum of a geo- | 
metric series and also explains the Pingala algorithm for finding the required power 
of the common ratio between the terms of the series; ?? 


PEHESESE Tore kr forauaa: vU Tele] tH! 

| 

ga worsted faardran i | 

eae rsrerest quafarit aias TA | | 

| 

Sa: Wag em wen AKAM | 
The first term when multiplied by the product of the common ratio (guna) taken 

as many times as the number of terms (pada), gives rise to the gunadhana. | 

This gunadhana, when diminished by the first term and divided by the common | 

ratio less one, is to be understood as the sum of the geometrical series (guna- | 

sankalita). | 

| 

That is 

| 

=i a(r” = 1) l 

(r — 1) | 

i 


Virasena (c. 816), in his commentary Dhavala on the Satkhandagama, has made 
use of the sum of the following infinite geometric series in his evaluation of the | 
volume of the frustum of a right circular cone:?! | 


lm MN 1" 4 
1+3+(3) ++ (3) em Q6) 


! 
| 
The proof of the above result is discussed in the Aryabhatzya-bhasya (c. 1502) | 
of Nilakantha Somayajr. As we shall see later (section 10.1), Nilakantha makes 


a ar 4 ar? 4... ar" (25) 


use of this series for deriving an approximate expression for a small arc in terms 
of the corresponding chord in a circle. 


20Ganitasarasarigraha of Mahaviracarya, Ed. by Lakshmi Chanda Jain, Sholapur 1963, 
verses 2.93-94, pp. 28-29. | 


21see, for instance, T. A. Sarasvati Amma, Geometry in Ancient and Medieval India, Motilal 
Banarsidass, Delhi 1979, Rep. 2007, pp. 203-05. | 
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5. Use of Second-order differences and interpolation in computation of 
Rsines (Jyanayana) 


Jyà, Koti and Sara 


The jya or bhuja-jya of an arc of a circle is actually the half-chord (ardha-jyà 
or jyardha) of double the arc. In the Figure 2, if R is the radius of the circle, jya@ 
(Rsine), koti or koti-jya (Rcosine) and sara (Rversine) of the cápa (arc) EC are 
given by: 


jya(arc EC) 2 CD =  Rsin(ZCOE) Q7) 
koti (arc EC) - OD =  Rcos(ZCOE) (28) 

$ara (arc EC) - ED = Rvers(LCOE) 
= R-Rcos(/COE). (29) 


For computing standard Rsine-tables (pathita-7yà), the circumference of a circle 


E 
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5.1. Computation of Rsines 


Once the value of the radius R is fixed (in units of minutes, seconds etc.) the 24 
Rsines can be computed (in the same units) using standard relations of jyotpatti 
(trigonometry). For instance, Varahamihira has given the following Rsine values 
and relations in his Pancasiddhantika (c. 505)? 


Rsin(30) = i (30a) 
Rsin(45°) = E (30b) 
R sin(60°) = E R (30c) 
Rsin90) — R (304) 
Rsin(A) = Rcos(90— A) (31) 
Rsin?(A) + Rcos(A) = R? (32) 
Rsin (5) = (3) [R sin? (A) +R vers? (A)]? 
1 
= (5) " [R — Roos Al}. (33) 


The above Rsine values (30) and relations (31)-(33) can be derived using the 
bhuja-koti-karna-nyaya (Pythagoras theorem) and trairasika (rule of three for 
similar triangles), as is done for instance in the Vasana-bhasya of Prthüdaka- 
svamin (c. 860) on Brahmasphutasiddhanta (c. 628) of Brahmagupta. Equa- 
tions (30)-(33) can be used to compute all 24 tabular NEST 


5.2. Aryabhata's o e of Rsine-differences - OR b 
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wwarmgmsaunmi«pe4 uud Riata 
" qunrasuratspeqne-dris Srarfor i 


The first Rsine divided by itself and then diminished by the quotient will give 
the second Rsine-difference. The same first Rsine, diminished by the quotients 
obtained by dividing each of the preceding Rsines by the first Rsine, gives the 
remaining Rsine-differences. 


Let Bj = Rsin (225), B2 = R sin (450’),..., B24 = R sin (90°), be the 
twenty-four Rsines, and let A; = By, A2 = B2 — By,..., Ak = Bk — By-1,«.. 
be the Rsine-differences. Then, the above rule may be expressed as*4 


By 
A2 = = == 34 
2 By Bi (34) 
Bı + Boc... B 
Aka = guo SU (1055150: 1.523): (85) 
; : 
i This second relation is also sometimes expressed in the equivalent form 
$ 
{ And A dE ooo EN 
ish = Aye Art fA (ENIRO 1803) GO) 


Bi 


From the above it follows that 


Akti = Ak = — (k = 1,2, ...,23). (37) 


(38) 
(= i sioe 229) . (989) 


pb Sep ura 0) 6s 5 
e n for the second-order: 
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the first and second-order Rsine-differences as given in Yuktibhasa ?? later in 
Section 16. Here we shall only summarize the argument. 

In Figure 3, the arcs EC; and EC j,, are successive multiples of 225’. The 
Rsine and Rcosine of the arcs EC; and EC, are given by 


Bj = CjPj, Byj+1 = Cj+iPj+i (40) 
and Kj = CjTj, Kj+i =Cj+iTj+, (41) 


respectively. Let M;..; and M; be the mid-points of the arcs Cj;Cj41, Cj-1Cj 
and the Rsine and Rcosine of the arcs EM; and E Mj. be denoted respectively 


by B,_1, B KERKE 


321.- Sth? IA TS 


rE f 
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$ Let the chord of the arc C;Cj+1, be denoted by a and let R be the radius. Then 
JS) 
a simple argument based on trairagika (similar triangles) leads to the relations:?Ó 


a 
Bji-B; = (2) Kj4) (42) 
a 
Ky Kiy = (5) 2) p 
Thus we get 
Aj+ı— Aj = (Bj41 — Bj) - (Bj - Bj-1) 
a2 
ERE (=) Bj. (44) 
We can also express this relation in the form 
—Bj(^1— ^ 
: (ps Ay 2 (45) 
Bi 
pe The above relations are exact. Aryabhata’s relation (39) corresponds to the ap- 


proximations, Bj + 225’ and A; — A2 7 1’ so that 


Rhy. ar (=I) 1 
DES) ui 


(Ai = 42) 
Bi 
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This is further refined by Saükara Variyar in his commentary Laghu-vivrti in the 


form .28 
a\2 M (A; — A2) 1 
(x) B Bi ~ (saan) Ce 


Since a = 2R sin 112/30", we find that the above relation is correct up to seconds. 
Commenting on Aryabhata’s method of computing Rsines, Delambre had re- T 
marked:?? 


The method is curious: it indicates a method of calculating the table of sines by 
means of their second-differences... This differential process has not up to now 
been employed except by Briggs, who himself did not know that the constant 
factor was the square of the chord AA (= 3°45’) or of the interval, and who could 1 
not obtain it except by comparing the second differences obtained in a different t 
manner. The Indians also have probably done the same; they obtained the method i 
of differences only from a table calculated previously by a geometric process. 
Here then is a method which the Indians possessed and which is found neither 
amongst the Greeks, nor amongst the Arabs. 


5.4. The Rsine-table of Aryabhata 


In the Gitika-pada of Aryabhatiya, Aryabhata has given a table of Rsine- 
differences:*° 


afa fed ger Uae Ree 
CT SEDE FAT GF HOST: | 


225, 224, 222, 219, 215, 210, 205, 199, 191, 183, 174, 164, 154, 143, 131, 119, 
106, 93, 79, 65, 51, 37, 22, and 7—these are the Rsine-differences [at intervals of 
225' of arc] in terms of the minutes of arc. 


The above values follow directly from Aryabhata’s relation (39) forthe second 

order Rsine-differences. To start with, Aj; = Bj = ues m l 

get, A2 = B1 — B. — 224' and so on. Ty 
The Rsine-table of Aryabhata?! (see Table 1), obtained. 

to minutes. In this table, we also give the Rsine 

(c. 825) in his commentary on Mahabhaskariy 


28tpiq.. comm. on verse 2.4. 
bre, Historie de I’ A 
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(c. 1340-1420) as recorded in the Aryabhatiya-bhasya (c. 1502) of Nilakantha 

Somayaji. Though Govindasvamin gives the Rsine values up to the thirds, his 

values are accurate only up to the seconds; those of Madhava are accurate up to 
| the thirds. 


j Arc-length | Aryabhata | Govindasvāmin Madhava 
| (c. 499) (c. 825) (c. 1375) 


3°45 225| 224' 50" 23” 224' 50" 22” 


a 448’ 42" 53” | 448' 42" 58” 
z 670' 40" 11" | 670' 40" 16" 
889 45” 08" | 889' 45" 15" 


1105’ 01” 30” 
1815833456% 
1520’ 28" Ul 
1718’ 52" 10" 
1909' 54" 19" 


1105' 01" 39" 
1315' 34" 07” 
1520' 28" 35" 
1718' 52" 24” 
1909' 54" 35” 


€———MÀÀ —À— ÀÀ 
Lidia i wens se m 


2092! 45" 46” | 2092! 46" 03” 

2266' 38" 44” | 2266' 39" 50” 

2430' 50" S4" | 2430' 51" 15" 

2584' 37" 43” | 2584' 38" 06" 

j 27271! 20" 29" | 2727 20" 52” 


2858/ 22" 31” 
2977 10" 09” 
3083' 12" 51” 


2858' DIL 55" 
2977 10" 34" 
3083' 13" 17” 


3176' 03" 23" | 3176' 03" 50” 
8255417145474 1432551 18//122/ 
3320' 36" 02" | 3320' 36" 30” 
3371' 41" 01" | 3371' 41" 29" 


3408' 19" 42” | 3408' 20" 11” 
| 3430' 22" 42" | 3430' 23" 11” 


3437 44" 19" | 3437' 44" 48" 
ta Govindasvamin and Madhava. 


5.5. Brahmagupt: 
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work, which is in the form of a manual (karana) for astronomical calculations, 
Brahmagupta uses a simpler Rsine-table which gives Rsines only at intervals of 
15° or 900/:32 


THAT ARIS HANH OAT Tea ATM | 
aqna gii sirrargenfüss sima i 


Multiply the residual arc after division by 900’ by half the difference of the tabular 
Rsine difference passed over (gata-khanda) and to be passed over (bhogya- 
khanda) and divide by 900’. The result is to be added to or subtracted from half 
the sum of the same tabular sine differences according as this [half-sum] is less 
than or equal to the Rsine tabular difference to be passed. What results is the true 
Rsine-difference to be passed over. 


Let h be the basic unit of arc in terms of which the Rsine-table is constructed, 
which happens to be 225’ in the case of Aryabhatiya, and 900’ in the case of 
Khandakhadyaka. Let the arc for which Rsine is to be found be given by 


s=jht+e for some j = 0,1,... (49) 


Now Rsin(jh) = Bj are the tabulated Rsines. Then, a simple interpolation 
(trairasika) would yield 


Rsin(jh+6) = B; + (7) (Bj+1 — Bj) 
= Rsin(jh) + = Ajs: (50) 


Instead of the above simple interpolation, Brahmagupta prescribes 


Rsin(jh + €) = B; + (=) IG ) a+ anehi ;) Gc. 6D 


Here, the sign is chosen to be positive if A; < A j+1, and neg 


(as in the case of Rsine). So Brahmagupta's rule i is. actually 
interpolation formula z 


Rsin(jh + e€) 
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6. Instantaneous velocity of a planet (£atkalzka-gat:) 
6.1. True daily motion of a planet 


In Indian Astronomy, the motion of a planet is computed by making use of two 
corrections: the manda-samskara which essentially corresponds to the equation 
of centre and the $;ghra-samskàra which corresponds to the conversion of the 
heliocentric longitudes to geocentric longitudes. The manda correction for planets 


is given in terms of an epicycle of variable radius r, which varies in such a way 
that 


r 


ro 
H 53 
K R' 62) 


where K is the karna (hypotenuse) or the (variable) distance of the planet from 
the centre of the concentric and ro is the tabulated (or mean) radius of the epicycle 
in the measure of the concentric circle of radius R. 
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longitude of the ucca, then the correction (manda-phala) A p is given by 


Rsin(Ap) @) Rsin(M — a) 


(z) R sin(M — a). (54) 


For small r, the left hand side is usually approximated by the arc itself. The 
manda-correction is to be applied to the mean longitude M, to obtain the true or 
manda-corrected longitude u given by 


pow (a) (x) Rsin(M — a). (55) 


If nm and n, are the mean daily motions of the planet and the ucca, then the true 
longitude on the next day is given by 


Uu +n = (M +nm)-— (2) (x) Rsin(M+nm—a—ny). — (S6) 


The true daily motion is thus given by 


n — ng — (2) (x) [R sin{(M — a) + (nm —n,)) — R sin (M —a)]. (57) 


The second term in the above is the correction to mean daily motion (gati-phala). 
An expression for this was given by Bhaskara I (c. 629) in Mahabhaskariya, 
where he makes use of the approximation:?? 


Rsin((M — a) + (nm — nu)} (rim — nu) x 
— Rsin(M — a) l^ zs) Rsine-difference at (M — a). 

(58) 

In the above approximation, (nm — Mnu) is multiplied by tabular Rsine-difference 

at the 225' arc-bit in which (the tip of the arc) (M — a) is located. Therefore, 

under this approximation, as long as the anomaly (kendra), (M — a), is in the 

same multiple of 225’, there will be no change in the gati-phala or the correction 

to the mean velocity. This defect was noticed by Bhaskara also in his later work 

Laghubhaskartya:>* 


3rfzrere war 218 SIAN: | 
Taare starrer] aA TATA I 


3, Mahabhaskariya of Bhaskara I, Ed. by K. S. Shukla, Lucknow 1960, verse 4.14, p. 120. 
34 r aghubhüskariya. of Bhaskara I, Ed. by K. S. Shukla, Lucknow 1963, verses 2.14-5, p. 6. 
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yaoa na Aaea i 
Bd 
Whilst the Sun or the Moon moves in the [same] element of arc, there is no change 


in the rate of motion (bhukti), because the Rsine-difference does not increase or 
decrease; viewed thus, the rate of motion [as given above] is defective. 


The correct formula for the true daily motion of a planet, employing the Rcosine 
as the ‘rate of change’ of Rsine, seems to have been first given by Munjala (c. 932) 
in his short manual Laghumanasa 35 and also by Áryabhata II (c. 950) in his 
Mahà-siddhànta:?6 


pierod afa umeros i 


The kotiphala multiplied by the [mean] daily motion ‘and divided by the radius 
gives the minutes of the correction [to the rate of the motion]. 


This gives the true daily motion in the form 


| A — Ts) Gi (x) R cos(M — a). (59) 


6.2. The notion of instantaneous velocity (tatkalikagatz) according to 
Bhaskaracarya Il 


Bhaskaracarya II (c. 1150) in his Siddhantasiromani clearly distinguishes the 
true daily motion from the instantaneous rate of motion. And he gives the Rcosine 
correction to the mean rate of motion as the instantaneous rate of motion. He 
further emphasizes the fact that the velocity is changing every instant and this is 
particularly important in the case of Moon because of its rapid motion.?? 


Yaya aaa Fa: MAA IT A WAT HC: I 

The true daily motion of a planet is the difference between the true planets on 
ti successive days. And it is accurate (sphuta) over that period. The kotiphala 
(Rcosine of anomaly) is multiplied by the rate of motion of the manda-anomaly 
(mrdu-kendra-bhukti) and divided by the radius. The result added or subtracted 
from the mean rate of motion of the planet, depending on whether the anomaly is 
in Karkyadi or Mrgadi, gives the true instantaneous rate of motion (tatkalikz 
manda-sphutagati) of the planet. 


| 
| 


35 Laghumanasa of Mufijala, Ed. by K. S. Shukla, New Delhi 1990, verse 3.4, p. 125. 

36 Mahasiddhanta of Aryabhata II, Ed. by Sudhakara Dvivedi, Varanasi 1910, verse 3.15, p. 58. 

37 Siddhantosiromani of Bhaskaracarya, Ed. by Muralidhara Chaturvedi, Varanasi 1981, 
verses 2.36-8, p. 119. 
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In the case of the Moon, the ending moment of a tithi?® which is about to end or 
the beginning time of a tzthi which is about to begin, are to be computed with the 
instantaneous rate of motion at the given instant of time. The beginning moment 
of a tithi which is far away can be calculated with the earlier [daily] rate of 
motion. This is because Moon's rate of motion is large and varies from moment 
to moment. 


Here, Bhaskara explains the distinction between the true daily rate of motion 
and the true instantaneous rate of motion. The former is the difference between 
the true longitudes on successive days and it is accurate as the rate of motion, on 
the average, for the entire period. The true instantaneous rate of motion is to be 
calculated from the Rcosine of the anomaly (kotiphala) for each relevant moment. 

Thus if c, and o, are the rates of the motion of the mean planet and the ucca, 
then @m — @, is the rate of motion of the anomaly, and the true instantaneous rate 
of motion of the planet at any instant is given by Bhaskara to be 


€ = Om + (Om z Ou) (2) (z) R cos(M = a), (60) 


where (M — a) is the anomaly of the planet at that instant. 


Bhaskara explains the idea of the instantaneous velocity even more clearly in 


his Vasana-? 


CEISERESEES CISA: BraAraearssaral AKTHT- 
feravdrat mam Horie ur SECT ala: | AeA 
aà apmifada aaa afl quy GIOI 
mat aaa ame maai usus sful gd fea 


wear fiat ed uasdi qur TAI FI «pd 


RRA gudrerd 4T RA aaa Vea 
IAA | ISHS TASTE WRUHU HAT A 
wate arredi AINSAT: | 


$ Tithi is the time taken by the Moon to lead the Sun exactly by 12? in longitude. 
39 Siddhantasiromani, cited above, Vāsanā on 2.3638, p. 119-20. 
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ay weeds STR DADETSUIRGDÉROANO AA: | 

Se Tere ait ngo ore TRI 
IATA SAR FTAA: | SATE aga. 
ws qd« UT ToT RNR: (aas) HAT aA aad 


i 
z 


The true daily velocity is the difference in minutes etc., between the true planets 
of today and tomorrow, either at the time of sunrise, or mid-day or sunset. If to- 
morrow's longitude is smaller than that of today, then we should understand the 
motion to be retrograde. It is said “over that period". This only means that, dur- 
ing that intervening period, the planet is to move with this rate [on the average]. 
This is only a rough or approximate rate of motion. Now we shall discuss the 
instantaneous rate of motion... In this way, the manda-corrected true instanta- 
neous rate of motion (tatkalikt manda-parisphutagati) is calculated. In the 
case of Moon, this instantaneous rate of motion is especially useful...Because of 
its largeness, the rate of motion of Moon is not the same every instant. Hence, in 
the case of Moon, the special [instantaneous] rate of motion is stated. 


Then, the justification for the correction to the rate of motion (gati-phala- 

vàsanà). . The rate of motion of the anomaly is the difference in the anomalies of 

f b today and tomorrow. That should be multiplied by the [current] Rsine-difference 

| used in the computation of Rsines and divided by 225. Now, the following rule 

| of three to obtain the instantaneous Rsine-difference: If the first Rsine-difference 

225 results when the Rcosine is equal to the radius, then how much is it for the 

| given Rcosine. In this way, the Rcosine is to be multiplied by 225 and divided 

by the radius. The result is the instantaneous Rsine-difference and that should be 
l multiplied by the rate of motion in the anomaly and divided by 225 . 


Thus, Bhaskara is here conceiving also of an instantaneous Rsine-difference, 
though his derivation of the instantaneous velocity is somewhat obscure. These 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


m Ne 


Development of Calculus in India 227 


ideas are more clearly set forth in the Aryabhatzya-bhàsya (c. 1502) of Nilakantha 
Somayaji and other works of the Kerala School. 


6.3. The szghra correction to the velocity and the condition for retrograde 
motion 


Bhaskara then goes on to derive the correct expression for the true rate of motion 
as corrected by the Szghra-correction. In the language of modern astronomy, the 
Sighra-correction converts the heliocentric longitude of the planets to the geocen- 
tric longitudes. Here also, the Indian astronomers employ an epicycle, but with a 
fixed radius, unlike in the case of the manda-correction. 

If 4 is the manda-corrected (manda-sphuta) longitude of the planet, ¢ is the 
longitude of the stghrocca, and r;, the radius of the s;ghra-epicycle, then the cor- 
rection (Szghra-phala) ^c is given by 


Rsin(Ac) = (z) R sin (u — č), (61) 


where (x — ¢) is the sighrakendra and K is the hypotenuse (Szghrakarna) given 
by 


K? = R? +r? — 2Rrs cos (u — C). (62) 


The calculation of the sighra-correction to the velocity is indeed much more dif- 
ficult as the denominator in (61), which is the hypotenuse which depends on 
the anomaly, also varies with time in a complex way. This has been noted by 
Bhaskara who was able to obtain the correct form of the s¢ghra-correction to the 
velocity (szghra-gati-phala) in an ingenious way.*? 


maarag aAA AAA saat: ae TATI 
waaay: neue: ag a amt ATA 1 


The Rsine of ninety degrees, Jess the degrees of Sighra-correction for the longi- 
tude (Stghra-phala), should be multiplied by the rate of motion of the stghra- 
anomaly (drak-kendra-bhukti) and divided by the hypotenuse (stghra-karna). 
This, subtracted from the rate of motion of the stghrocca, gives the true velocity 
of the planet. If this is negative, the planet’s motion is retrograde. 


If c is the rate of motion of the manda-corrected planet and œs is the rate of 
motion of the sighrocca, then the rate of motion of the stghra-anomaly is (#—@,), 


40 Siddhantasiromani, cited above, verse 2.39, p. 121. 
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and the true velocity of the planet œ; is given by 


[2 — w)R oao) 
wr = ws — | -—————————— |. 


K (63) 


The details of the ingenious argument given by Bhāskara for deriving the correct 
form (63) of the stghra-correction to the velocity has been outlined by D. Arkaso- 
mayaji in his translation of Stddhantasiromani.*! 

Since Bhaskara's derivation is somewhat long-winded, here we shall present 


a modern derivation of the result just to demonstrate that the expression given by 
Bhaskara is indeed exact. 


of the orbits of 


specti and 
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If v; is the linear velocity of the planet as seen from the Earth, then the angular 
velocity is given by 


==. (64) 


The magnitude of v; in terms of v and v; (for the situation depicted in Figure 5a) 
is 


0; = v cos Ac +0; cosQ. (65) 


Also, from the triangle S E P, the distance of the planet from the Earth—known as 
karna, and denoted K in the figure—may be expressed as 


K = Rcos Aoc +rcos6, 
K — Rcos ^ 
or cosh = — M (66) 
Using (66) in (65) we have 
DE v cos Ag + ~= (K — R cos Ac) 
ferus) 
= D = ay 
r 
R 
vr vs cos Ac (v — vs £) z 
Eo Laa es 7 
or x xa x (67) 


Making use of (64) and the fact that v = Rw and vs = ræs, the above equation 
reduces to 


—: [e — w)R cos = 
= Ws BENI E. , 


which is same as the CHE on given by Bhaskara (63). 
Bhaskara in his Vasana@‘ explains as to why in the ; 

procedure for finding the rate of motion of hopi 

one used in the manda processi: uui ; 
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Wegen id fee 3mqdq-a nme Wet 
wordt ASI Saale Ure Wes A TEN ATTA 
egi AAT HU HIST EMTS TERN AR- 
aera Rara aerafeafe: PIATA vet... 


Here is the justification. The stghra-correction to the rate of motion is the dif- 
ference between the szghra-phalas of today and tomorrow. If that is derived in 
the same way as the manda-correction to the rate of motion, the result will be 
incorrect even if it were to be divided by the hypotenuse (szghra-karna)... The 
difference is not just due to the change in the anomaly [which is the argument 
of the Rsine] but also otherwise... The result of dividing by today’s hypotenuse 
is different from that of dividing by that of tomorrow. Even if the hypotenuses 
turn out to differ by small amount, the quantities they divide are large and thus 
a large difference could result. Hence, this way of approach [which was adopted 
in the case of manda-correction to the rate of the motion] has been forsaken and 
another has been devised by the great intellects. That is as follows... 


6.4. The equation of centre is extremum when the velocity correction van- 
ishes 


| Later, in the Goladhyaya of Siddhantasiromani, Bhaskara considers the situa- 
|| tion when the correction to the velocity (gati-phala) vanishes:*? 


HATA Maa AAAI | 

mea WÍd: GET WX FS qu MCI 

Where the [North-South] line perpendicular to the [East-West] line of apsides 
through the centre of the concentric meets the eccentric, there the mean velocity 
itself is true and the equation of centre is extremum. 


In his Va@sana@, Bhaskara explains this correlation between vanishing of we 
velocity correction and the extrema of the correction to the planetary longitude:4 


j Pegaso fairer quur AA Fa: HA 
mata qf: wer AFETA fea qa uu wu wo 
CI 3 Tee wu we aaa TRRROTHIT- Hagen 
* TASCA Tf: | FOS Gern 
mer eub wees waa: | al TART 
aram afa wer gags qav eft quw) fe 
TES WH FHA! 


The mean rate of motion itself is exact at the points where the line perpendicular 
[to the line of apsides], at the middle of the concentric circle, meets the eccentric 


———— 


43 Siddhantasiromani, cited above, Goladhyaya 4.39, p. 393. 
^51bid., Vasana on Goladhyaya 4.39. 


p—X———S»————————————— 
MA te e t ipit c " 


mg 


ee 
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circle. Because, there is no correction to the rate of motion [at those points]. Also, 
because there the equation of centre [or correction to the planetary longitudes] is 
extreme. Wherever the equation of centre is maximum, there the correction to the 
velocity should be absent. Because, the rate of motion is the difference between 
the planetary longitudes of today and tomorrow. The correction to the velocity is 
the difference between the equations of centre. The place where the correction to 
the velocity vanishes, there is a change over from positive to the negative. And, 
what has been stated by Lalla, "the mean rate of motion is itself true when the 
planet is on the intersection of the two circles [concentric and eccentric]", that is 
incorrect. The planet does not have maximum equation of centre at the confluence 
of the two circles. 


w : 


LP 


FIGURE 5b. Equation of centre is extremum where the correc- 
tion to velocity vanishes. : 


the We vanishes, as it cl anges "m 
to state (as Lalla did in his 


velocity is zero at the pos et 
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(c. 750) seems to have given the correct expression for the volume of a sphere 
(Trisatika 56), though his estimate of z is fairly off the mark. Bhaskaracarya 
(c. 1150) has given the correct relation between the diameter, the surface area and 
the volume of a sphere in his Lilavatz^? 


quart WRÉUWIÉURTSHTWUIG: "BO Ud 
qo Fach uin eee Sea 
Mod qafü a eS yest amafi 
sfe safe fad atest warned 


In a circle, the circumference multiplied by one-fourth the diameter is the area, 

which, multiplied by four, is its surface area going around it like a net around 
À a ball. This [surface area] multiplied by the diameter and divided by six is the 
r1 volume of the sphere. 


am in the Siddhantasiromani (Goladhyaya 2.53-61), where Bhaskara has also pre- 
ee sented the wpapatti or justification for the results in his commentary Vasana. As 
regards the surface area of the sphere, Bhaskara argues as follows: ^Ó 


wy Tease Tea THAT pe ZUG Weird 
TPA TH HON (29500) Wao qea zer fad 
ea Wigmore swfgawuwg|4 (aay) 


| 
The surface area and volume of a sphere have been discussed in greater detail 


PIEE?ESUE EHSE RS HE 
afr wt quat WAATEA: (aay) scare Sarath 
amA g: quum | qu aA- 


at Th HOT: (ateo) Te eqrard fauc avaci 
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that there will be twenty-four circles. These circles will have as there radii Rsines 
starting from 225'. The measure [circumference] of the circle will be in propor- 
tion to these radii. Here, the last circle has a circumference 21, 600’ and its radius 
is 3,438’. The Rsines multiplied by 21, 600 and divided by the radius [3, 438] 
will give the [circumference] measure of the circles. Between any two circles, 
there is an annular region and there are twenty-four of them. If more [than 24] 
Rsines are used, then there will be as many regions. In each figure [if it is cut and 
spread across as a trapezium] the larger lower circle may be taken as the base and 
the smaller upper circle as the face and 225' as the altitude and the area calculated 
by the usual rule: [Area is] altitude multiplied by half the sum of the base and 
face. The sum of all these areas is the area of half the sphere. Twice that will be 
the surface area of the entire sphere. That will always be equal to the product of 
the diameter and the circumference. 


Here Bhaskara is taking the circumference to be C = 21, 600’, and the corre- 
sponding radius is approximated as R 7 3, 438’. As shown in Figure 6, circles 
are drawn parallel to the equator of the sphere, each separated in latitudes by 225’. 
This divides the northern hemisphere into 24 strips, each of which can be cut and 
spread across as a trapezium. If we denote the 24 tabulated Rsines by Bj, B»,... 
B24, then the area A; of j-th trapezium will be 


C (Bj + Bj) 
qe [ee] os. 
a (s) 510025 


Therefore, the surface area S of the sphere is estimated to be 


S=2 (z) E + B2 +...Bo3 + (=) (225). 


Now, Bhaskara states that the right hand side of the above equatic 
2CR. This can be checked by using Bhaskara's Rsine-table. B 
has done the summation of the Rsines in his Vasana on the si 


the surface of the sphere into lunes. In that con 
mené ved : 
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N 


— 


m Sie tim mime TE Jo 
ÀJ hes Pe 


FIGURE 6. Surface area of a sphere. 


‘Thus, according to Bhaskara's Rsine table 


E + Bo +....+ B23 + (= )] (225) 52514 x (225) 


~~ = 11815650 
unt = (3437.39y?. (70) 


eu les mentions that we may 
nj more arc-bits, instead of the usual 
i ds the approach 
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number, n, of equal arc-bits. If A is the Rsine of each arc-bit, the surface area is 
estimated to be 


C 
s=2($) (B1 + B2 +....Bn)(A). (72) 


Then it is shown that in the limit of large n, 


(Bi + Bo + ....Bn)(A) © R?, (73) 
which leads to the result 2C R for the surface area. ^? 
As regards the volume of a sphere, Bhaskara’s justification is much simpler:?? 


Thé surface area of a sphere multiplied by its diameter and divided by six is its 
volume. Here is the justification. As many pyramids as there are units in the sur- 
face area with bases of unit side and altitude equal to the semi-diameter should 
be imagined on the surface of the sphere. The apices of the pyramids meet at the 
centre of the sphere. Then the volume of the sphere is the sum of the volumes 
of the pyramids and thus our result is justified. The view that the volume is the 
product of the area times its own root, is perhaps an alien view (paramata) that 
has been presented by Caturavedacarya [Prthüdakasvamin]. 


We may note that it is the Aryabhatiya rule which is referred to as paramata 
in the above passage. Bhaskara's derivation of the volume of a sphere is similar 
to that of the area of a circle by approximating it as the sum of the areas of a 
large numbers of triangles with their vertices at the centre, which is actually the 
proof given in Yuktibhasa. In the case of the volume of a sphere, Yuktibhasa, 
however, gives the more "standard" derivation, where the sphere is divided into 
a large number of slices and the volume is found as the sum of the volumes of 
the slices—which ultimately involves estimating the sum of squares of natural 
numbers (varga-sarikalita), 12 + 22 +37 +... + n?, for large n?! 


49 Ganita-yukti-bhàsa, cited above, Section 7.18, PP. 140-42, 261-63, 465-67. In modern ter- 


minology, this amounts to the evaluation of the integral a Rsin8Rd0 = R?. 
S0Siddhantasiromani, cited above, Vasana on Goladhyaya 2.61, p. 364. 
51 Ganita-yukti-bhasa, cited above, Section 7.19, pp. 142-45, 263-66, 468-70. 
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PART II : WORK OF THE KERALA SCHOOL 


Madhava to Sankara Variyar (c. 1350-1550 CE) 


8. Kerala School of Astronomy 


The Kerala School of Astronomy in the medieval period, pioneered by Madhava 
(c. 1340-1420) of Sangamagrama, extended well into the 19th century as exem- 
plified in the work of Sankaravarman (c. 1830), Raja of Kadattanadu. Only 
a couple of astronomical works of Madhava ( Venvaroha and Sphutacandrapti) 
seem to be extant now. Most of his celebrated mathematical discoveries—such as 
the infinite series for z and the sine and cosine functions—are available only in 
the form of citations in later works. 

Madhava's disciple Parame$vara (c. 1380-1460) of Vatasseri, is reputed to 
have carried out detailed observations for over 50 years. A large number of orig- 
inal works and commentaries written by him have been published. However, his 
most important work on mathematics, the commentary Vivarana on Lilavati of 
Bhaskara II, is yet to be published. 

Nilakantha Somayaji (c. 1444-1550) of Kundagrama, disciple of 
Paramesvara’s son Damodara (c. 1410-1520), is the most celebrated mem- 
ber of Kerala School after Madhava. Nilakantha has cited several important 
results of Madhava in his various works, the most prominent of them being 
Tantrasangraha (c. 1500) and Aryabhatiya-bhasya. In the latter work, while 
commenting on Ganitapada of Aryabhatiya, Nilakantha has also dealt exten- 
sively with many-important mathematical issues. 

However, the most detailed exposition of the work of the Kerala School, start- 
ing from Madhava, and including the seminal contributions of Paramesvara, 
Damodara and Nilakantha, is to be found in the famous Malayalam work Yuk- 
tibhasa (c. 1530) of Jyesthadeva (c. 1500-1610). Jyesthadeva was also a dis- 
ciple of Damodara but junior to Nilakantha. The direct lineage from Madhava 
continued at least till Acyuta Pisarati (c. 1550—1621), a disciple of Jyesthadeva, 
who wrote many important works and a couple of commentaries in Malayalam 
also. 

At the very beginning of Yuktibhasa, Jyesthadeva states that he intends to 
present the rationale of the mathematical and astronomical results and procedures 
which are to be found in Tantrasarigraha of Nilakantha. Yuktibhasa, compris- 
ing 15 chapters, is naturally divided into two parts, Mathematics and Astronomy. 
Topics in astronomy proper, so to say, are taken up for consideration only from the 
eighth chapter onwards, starting with a discussion on mean and true planets. 
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The first seven chapters of Yuktibhasa are in fact in the nature of an inde- 
pendent treatise on mathematics and deal with various topics which are of rele- 
vance to astronomy. It is here that one finds detailed demonstrations of the re- 
sults of Madhava such as the infinite series for z, the arc-tangent, sine and the 
cosine functions, the estimation of correction terms and their use in the genera- 
tion of faster convergent series. Demonstrations are also provided for the classi- 
cal results of Aryabhata (c. 499) on kuttakara (linear indeterminate equations), 
of Brahmagupta (c. 628) on the diagonals and the area of a cyclic quadrilat- 
eral, and of Bhaskara II (c. 1150) on the surface area and volume of a sphere. 
Many of these rationales have also been presented mostly in the form of San- 
skrit verses by Sankara Variyar (c. 1500-1560) of Trkkutaveli in his commen- 
taries Kriyakramakari (c. 1535) on Lilavatz of Bhaskara II and Yukti-dipika on 
Tantrasangraha of Nilakantha. In fact, Sankara Variyar ends his commentary 
on the first chapter of Tantrasarnigraha with the acknowledgement:?? 


XS EIBEEOETEIESIESERSS ES HESS SP E 

Wq qeugeu Wusemd qur ef: 

Whatever has been the meaning as expounded by the noble dvija of Parakroda 
[Jyesthadeva] the same Has now been stated by me for the first chapter of 
Tantrasangraha. 


In the following sections we shall present an overview of the contribution of the 
Kerala School to the development of calculus (during the period 1350—1500), fol- 
lowing essentially the exposition given in Yuktibhasa. In order to indicate some 
of the concepts and methods developed by the Kerala astronomers, we first take 
up the issue of irrationality of z and the summation of infinite geometric series 
as discussed by Nilakantha Somayaji in his Aryabhatiyo-bhàsya. We then con- 
sider the derivation of binomial series expansion and the estimation of the sum of 
integral powers of integers, 1* + 2* +... + n* for large n, as presented in Yuk- 
tibhasa. These results constitute the basis for the derivation of the infinite series 
for & due to Madhava. We shall outline this as also the very interesting work 
of Madhava on the estimation of the end-correction terms and the transforma- 
tion of the z -series to achieve faster convergence. Finally we shall summarize the 
derivation of the infinite series for Rsine and Rcosine due to Madhava. 

In the final section, we shall deal with another topic which has a bearing on 
calculus, but is not dealt with in Yuktibhasa, namely the evaluation of the in- 
stantaneous velocity of a planet. Here, we shall present the result of Damodara, 
as cited by Nilakantha, on the instantaneous velocity of a planet which involves 


52 Tantrasarigraha of Nilakantha Somayajt, Ed. with Yukti-dipika of Saükara Variyar by 
K. V. Sarma, Hoshiarpur 1977, p. 77. The same acknowledgement appears at the end of the subsequent 
chapters also. 
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the derivative of the arc-sine function. There are indeed many works and com- 
mentaries by later astronomers of the Kerala School, whose mathematical contri- 
MIR I butions are yet to be studied in detail. We shall here cite only one result due to 
We Acyuta Pisarati (c. 1550-1621), a disciple of Jyesthadeva, on the instantaneous 

f velocity of a planet, which involves the evaluation of the derivative of the ratio of 
two functions. 


9. Nilakantha’s discussion of irrationality of z 


In the context of discussing the procedure for finding the approximate square root 
i of a non-square number, by multiplying it by a large square number (the method 
| given in Trisatika of Sridhara referred to earlier in Section 3.3), Nilakantha 
E! observes in his Aryabhatiya-bhàsya:?? 
I 


aa Wd WdlSUATWTRGTS AS MIT A qu: Suspe qud: 

M Li Wed ad rer gcufwrasq qt areata oiu 
p PATA eT UU ATATA 

Even if we were to proceed this way, "the square root obtained will only be ap- 
proximate. The idea [that is being conveyed] is, that it is actually not possible 
to exactly de-limit (paricchedah) the square root of a non-square number. Pre- 


cisely for this reason, multiplication by a large square was stated (recommended) 
in order to get as much accuracy as desired. 


Regarding the choice of the large number that must be made, it is mentioned 
one may choose any number—as large a number as possible—that gives the 


S REIR SS AME Dl 


er large number you want up to your satisfaction 
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As will be stated [by the author himself] — ‘this is [only] an approximate measure 
of the circumference of the circle whose diameter is twenty-thousand.’ In finding 
the circumference from the diameter, a series of inferences are involved. The 
approximate nature of this also stems from the fact that it involves finding square 
roots. All this will be explained later at the appropriate context. 


239 


Addressing the issue—later in his commentary, as promised earlier—while dis- 


cussing the value of z Nilakantha observes: 


56 


Raae: weg: waf: |... ae, AAT 


3mqaqugidenzreg ga qNÜUSEI Se] Sa: TU: 
aadi TEAL Ger ATTA ERI ? TAA TAT 
THAT! 8d: 


The relation between the unes and the diameter has been presented. 

.. Approximate: This value (62,832) has been stated as only an approximation to 
the circumference of a circle having a diameter of 20,000. “Why then has an ap- 
proximate value been mentioned here instead of the actual value?" It is explained 
[as follows]. Because it (the exact value) cannot be expressed. Why? 


Explaining as to why the exact value cannot be presented, Nilakantha contin- 


es: 


da aAa Aan aa: Aaaa: gn 
RA: qa: art a ee afk: 


fraga: QN aoa smutsf urea ya; ufu 
yè Hea Areas: seat: aft a Paaa 
TAT AeA Mea Teas Hews Vd eU 
Paaa q aft a wae gfd um: 


Given a certain unit of measurement (māna) in terms of which the diameter 
(vyàso) specified [is just an integer and] has no [fractional] part (niravayava), 
the same measure when employed to specify the circumference (paridhi) will 
certainly have a [fractional] part (sávayava) [and cannot be just an integer]. 
Again if in terms of certain [other] measure the circumference has no [fractional] 
part, then employing the same measure the diameter will certainly have a [frac- 
tional] part [and cannot be an integer]. Thus when both [the diameter and the 
circumference] are measured by the same unit, they cannot both be specified [as 


36pbiq. comm. on Ganitapdada 10, p. 41. 
57ybiq., pp. 41-42. 
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integers] without [fractional] parts. Even if you go a long way (i.e., keep on re- 
ducing the measure of the unit employed), the fractional part [in specifying one 
of them] will only become very small. A situation in which there will be no [frac- 
tional] part (i.e, both the diameter and circumference can be specified in terms of 
integers) is impossible, and this is the import [of the expression ásanna]. 


Evidently, what Nilakantha is trying to explain here is the incommensurability 
of the circumference and the diameter of a circle. Particularly, the last line of 
the above quote—where Nilakantha clearly mentions that, however small you 
may choose your unit of measurement to be, the two quantities will never become 
commensurate—is noteworthy. 


10. Nilakantha's discussion of the sum of an infinite geometric series 


In his Aryabhattya-bhasya, while deriving an interesting approximation for the 
arc of a circle in terms of the jya (Rsine) and the sara (Rversine), Nilakantha 
presents a detailed demonstration of how to sum an infinite geometric series. The 
specific geometric series that arises in this context is: 


La (a c (S assis 
77 a) "eese etn) REPRE 


We shall now present an outline of Nilakantha's argument that gives an idea 
of how the notion of limit was understood in the Indian mathematical tradition. 
aene E 
1 1 pessyt z B 
approximate formula for the arc in terms of jyā and sara 


arc whose length (assumed to be small) is to be determined 
sngt . In tg idian mathematical literature, 


can f seen from the figure, this termi- 
ese | eometrical objects look like a bow, a string 
] , and s, the expression for the arc 
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FIGURE 7. Arc-length in terms of jyà and sara. 


Nilakantha's proof of the above equation has been discussed in detail by Saras- 
vati Amma.** It may also be mentioned that the above approximation actually 
does not form a part of the text Aryabhatiya; but nevertheless it is introduced by 
Nilakantha while commenting upon a verse in Aryabhatiya that gives the arc 
in terms of the chords in a circle.?? The verse that succinctly presents the above 
equation (74) goes as follows: 


aaay sure Te YA: WT: | 


The arc is nearly (prayah) equal to the square root of the sum of the square of 
the Sara added to one-thirds of it, and the square of the 7ya. " 


The proof of (74) given by Nilakantha involves: we 
(1) Repeated halving of the arc-bit, capac to get cy ...cj... - ; 


(2) Finding the corresponding semi-chords. Jya DE cuo the /Rversines, 
Sara (s;). e! sao E 


2 O , - 


se 
- 


eam wb 


242 K. Ramasubramanian and M. D. Srinivas 


then it is seen that this difference decreases as the size of the capa decreases. 
Having made this observation, Nilakantha proceeds with the argument that 


e Generating successive values of the jj-s and s;-s is an ‘unending’ process 
(na kvacidapi paryavasyati) as one can keep on dividing the capa into half 
ad infinitum (anantyat vibhagasya). 


e It would therefore be appropriate to proceed up to a stage where the differ- 
ence 6; becomes negligible (sunyapraya) and make an ‘intelligent approxi- 
mation', to obtain the value of the difference between c and j approximately. 


The original passage in Aryabhatiya-bhasya which presents the above argument 
reads as follows:°! 


10.2. Nilakantha's summation of the infinite geometric series 


The question that Nilakantha poses as he commences his detailed discussion on 
the sum of geometric series is very important and arises quite naturally whenever 
‘one encounters the sum of an infinite series; 9? 


: aaga aud magid Ww ? 


you know that [the sum of the series] increases only up to that [limiting 
at it certainly increases up to that [limiting value]? 


ling t ) Ansv wer the above | question, Nilakantha first states the general 


JU. 
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is assumed to be greater than 1. He further notes that this result is best demon- 
strated by considering a particular case, say r = 4. In his own words: 


Tad Vd U: Te: aaa: ft 
Can: TA ATAA eraser unma Aaaf: 
Waa UeRITSERRISRITES Uda ATAA | Taat 
- areas drag wur wRTR 


It is being explained. Thus, in an infinite (ananta) geometrical series (tulya- 
ccheda-parabhàga-paramparaá) the sum of all the infinite number of terms con- 
sidered will always be equal to the value obtained by dividing by a factor which is 
one less than the common factor of the series. That this is so will be demonstrated 
by first considering the series obtained with one-fourth (caturamsa-parampara). 


What is intended to be demonstrated is 


l NN a 
- ~ - Paa = = o 75 
TORO) E | 3 o 
Besides the multiplying factor a, it is noted that, one-fourth and one-third are 
the only terms appearing in the above equation. Nilakantha first defines these 
numbers in terms of one-twelfth of the multiplier a referred to by the word rasi. 
For the sake of simplicity we take the rasi to be unity. 


jus 


Ln 1 | 
Do g iD 3 . 
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Nilakantha then goes on to present the following crucial argument to derive the 
sum of the infinite geometric series: As we sum more terms, the difference be- 
tween : and sum of powers of i (as given by the right hand side of (76)), becomes 
extremely small, but never zero. Only when we take all the terms of the infinite 
series together do we obtain the equality 


O iy - 77) 
msn eu i) Tec ( 


64 


A brief extract from the text presenting the above argument is given below: 


acqua ufafedrat 
agf: eareate feriae sq spores srford sf i 


Three times one-twelfth of a rasi is one-fourth (caturamSa) [of that ras]. Four 
times that is one-third (£ryazáa). [Considering] four times that [one-twelfth of 
the raz] which is one-third, three by fourth of that falls short by one-fourth [of 
one-third of the ra@si]. Three-fourths of that [i.e., of 35 of the rasi] which is 
one-fourth of that (tryama), again falls short [of the same] by one-fourth of 
ae ed one-fourth [of one-third of the rast] . . 


. Since the result to be demonstrated or the process to be carried out is never 

- — ending (Gnantyat) and the difference though very small (atisuksmatvat) [still 
exists and the sum of the series] cannot be simply taken to be one-third. It seems 
. that the process is incomplete since always something remains because of its never 
| nature. In fact, since in all the problems involving [infinite] series, by 
ing in all the terms and placing them together, the process would [in princi- 


p e] become complete, here yi in the mathematics d doy repeated multiplication 
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11. Derivation of binomial series expansion 


Yuktibhasa presents a very interesting derivation of the binomial series for (1 + 
x)! by making iterative substitutions in an algebreic identity. The method given 
in the text may be summarized as follows.9? 

Consider the product a (£), where some quantity a is multiplied by the multi- 
plier c, and divided by the divisor b. Here, a is called gunya, c the gunaka and 
b the hàra, which are all assumed to be positive. Now the above product can be 
rewritten as: 


a (2) 2-289. (78) 


In the expression abo in (78) above, if we want to replace the division by b 
(the divisor) by division by c (the multiplier), then we have to make a subtractive 
correction (called sodhya-phala) which amounts to the following equation. 


929.60 (en oy (79) 
b c € b 


Now, in the second term (inside parenthesis) in (79)—which is what we referred 
to as Sodhya-phala, which literally means a quantity to be subtracted—if we again 
replace the division by b by division by c, then we have to employ the relation (79) 
once again to get another subtractive term 


Cm a- [, 829 pag OEO) B $22] 


b c b 
È a- [a872 -, 029, 0-9 .1] 
c c c b 
2 eo 
xs poai |e (b z ye |a (b ad ^ (: ED. E 


ao e 


Here, the quantity a ČE * is called dvitiya- 
one subtracted from that is dvitiya-sodhya-p 
operations, the m‘? Sodhya-phala subtrac 


F 
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ty Since the successive Sodhya-phalas are subtracted from their immediately pre- 

j WE ceding term, we will end up with a series in which all the odd terms (leaving 

E out the gunya, a) are negative and the even ones positive. Thus, after taking m 

j | Sodhya-phalas we get 
| 


Hi Ex E- 2 E. m 
ig ye — a-al=9 yafe 2| —...4+(-1)"a olen 
b G c c 


Í +(-1)"4q [12 e-f C= (81) 
c = 


í Regarding the question of termination of the process, both the texts Yuktibhāsā 
i and Kriyākramakarī clearly mention that logically there is no end to the process 
| of generating sodhya-phalas. We may thus write our result as: 
f 


2 m-—1 
= 2-499 +a [2] -+ Dra [C2 
T c 2 c c 
7 . 4 (-1)"a [223 2T +. c (82) 


It is also noted that the process may be terminated after having obtained the 
desired accuracy by neglecting the subsequent phalas as their magnitudes become 
smaller and smaller. In fact, Kriyakramakar explicitly mentions the condition 
under which the succeeding phalas will become smaller and smaller:9? 


Wag: waren gash gerd: aia a wafer: | rate 
tU SCS Alege tert Sa 
Rast dune es TT 


S19 


terms t that may be obtained further (pasca- 
phalas will become smaller and smaller 
1 was parmi is smaller than gunaka 
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12. Estimation of sums of 1* + 2* + . . . n* for large n 


As mentioned in section 4.1, Aryabhata has given the explicit formulae for the 
summation of squares and cubes of integers. The word employed in the In- 
dian mathematical literature for summation is sarikalita. The formulae given by 
Aryabhata for the sarikalitas are as follows: 


1 
s® = Morc p eid 
sO = Pa Taco EDE 
1 2 
se = P+ P+ tna [SEO] c (83) 


From these, it is easy to estimate these sums when n is large. Yuktibhasa gives a 
general method of estimating the sama-ghata-sankalita 


SO — 1k 2E ess nf, (84) 


when n is large. The text presents a general method of estimation, which does not 
make use of the actual value of the sum. In fact, the same argument is repeated 
even for k — 1,2,3, although the result of summation is well known in these 
cases. 


12.1. The sum of natural numbers (Maüla-sarikalita) 


Yuktibhasa takes up the discussion on sarikalitas in the context of evaluating the 
circumference of a circle which is conceived to be inscribed in a square. It is 
half the side of this square that is being referred to by the word bhuja in both 
the citations as well as explanations offered below. Half of the side of pais 
(equal to the radius) is divided into n equal bits, known as bhuja-kh 
these bhuja-khandas (=), 2(E)--- whose powers are summed. 
To start with, Yuktibhasa discusses. Hgts T 
called Mula sankalita We now cite the following í 
tibhasa:® : 


uU wei. 
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! next one will be two segments less. Here, if all the terms (bhujas) had been equal 

^ ls to the radius, the result of the summation would be obtained by multiplying the 
| T radius by the number of bhujas. However, here, only one bhuja is equal to the 
| 


i 
equal to the radius. The term before that will be one segment (khanda) less. The 


Wn P radius. And, from that bAuja, those associated with the smaller hypotenuses are 
dul less by one segment each, in order. Now, suppose the radius to be of the same 
i number of units as the number of segments to which it has been divided, in order 
i to facilitate remembering (their number). Then, the number associated with the 
| penultimate bhujā will be less by one (from the number of units in the radius); 
| ! the number of the next one, will be less by two from the number of units in the 
d I radius. This reduction (in the number of segments) will increase by one (at each 
TEE step). The last reduction will practically be equal to the measure of the radius, for 
it will be less only by one segment. In other words, when the reductions are all 
added, the sum thereof will practically (pra@yena) be equal to the summation of 
the series from 1 to the number of units in the radius; it will be less only by one 
radius length. Hence, the summation will be equal to the product of the number 
of units in the radius with the number of segments plus one, and divided by 2. 
The summation of all the bhujds of the different hypotenuses is called bhuja- 
sankalita. 
Now, the smaller the segments, the more accurate (sūksma) will be the result. 
Hence, do the summation also by taking each segment as small as an atom (anu). 
Here, if it (namely, the bhuja or the radius) is divided into parardha (a very 
large number) parts, to the bhuja obtained by multiplying by parardha add one 
part in parardha and multiply by the radius and divide by 2, and then divide by 
parardha. For, the result will practically be the square of the radius divided by 
two. ... 


—— 


n——— 


— I À——Á MÀ Àá— X 
" v——— ———MÀÀ 


\ 


The first summation, the bhuja-sankalita, may be written in the reverse order 
from the final bhuja to the first bhuja as 


s® = (=) 2c (=>) Pasch (2) ; (85) 


jy T fthe rd as being infinitesimal (amu) and at the same 


qua STS 
D I EL 1. : (86) 
peat uit v. à 
nea ure of radius (n) then the sum would be 
ut only the the first term is of the measure 
Me ha Ceres two segments 
Sunbequal to radius-minus- 


ent In other Y f : A : Tat, T RD 


UNT a 
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When n is very large, the quantity to be subtracted from n? is practically (prayena) 
the same as sl ) thus leading to the estimate 


SO a n^—Ss(0, (88) 
or SD x = (89) 


It is stated that the result is more accurate, when the size of the segments are small 
(or equivalently, the value of n is large). 

If instead of making the approximation as in (88), we proceed with (87) as it is, 
we get s — n2— (s — n), which leads to the well-known exact value of the 
sum of the first natural numbers 


| 
| 


(90) 


With the convention that the £ is of unit-measure, the above estimate (89) is stated 
in the form that the bhuja-sankalita is half the square of the radius. 


12.2. Summation of squares ( Varga-sankalita) 


We now cite the following from the translation of Yuktibhasa:”° 

Now is explained the summation of squares (varga-sarikalita). Obviously, the 
squares of the bhujas, which are summed up above, are the bhujas each multi- 
plied by itself. Here, if the bhujas which are all multipliers, had all been equal to 
the radius, their sum, (sarikalita derived above), multiplied by the radius would 
have been the summation of their squares. Here, however, only one multiplier 
happens to be equal to the radius, and that is the last one. The one before that will 
have the number of segments one less than in the radius. (Hence) if that, (i.e., the 
second one), is multiplied by the radius, it would mean that one multiplied by the 
penultimate bhuja would have been the increase in the summation of the squares. 
Then (the segment) next below is the third. That will be less than the radius by 
two segments. If that is multiplied by the radius, it will mean that, the summa- 
tion of the squares will increase by the product of the bhuja by two (segments). 
In this manner, the summation in which the multiplication is done by the radius 
(instead of the bhujās) would be larger than the summation of squares by terms 


S^ $ankara Variyar also emphasizes the same idea, in his discussion of the estimation of 


ipee. in his commentary Kriyakramakari on Lilavati (cited above (fn. 14), comm. on verse 
199, p. 382.): 


TSAA WT Ha Wand FT Van 


Only when the segment is small (khandasyálpatve) the result obtained would be 
accurate. 


70 : : ET 3 
Ganita-yukti-bhásà, cited above, Section 6.4, pp. 61-67, 192-97, 382-88. 
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I | 

| i which involve the successively smaller bhujäās multiplied by successively higher 
n numbers. If (all these additions) are duly subtracted from the summation where 
A] | the radius is used as the multiplier, the summation of squares (varga-sankalita) 
lis j will result. 

' | Now, the bhuja next to the east-west line is less than the radius by one (segment). 

[ 


So if all the excesses are summed up and added, it would be the summation of the 
basic summation (miila-sankalita-sankalita). Because, the sums of the sum- 
f | mations is verily the ‘summation of summations’ (sarikalita-sanrkalita). There, 
HII i the last sum has (the summation of) all the bhujas. The penultimate sum is next 
| lower summation to the last. This penultimate sum is the summation of all the 
JM bhujàs except the last bhuja. Next to it is the third sum which is the sum of all 
i the bhujas except the last two. Thus, each sum of the bhujas commencing from 
any bhuja which is taken to be the last one in the series, will be less by one bhuja 

from the sum (of the bhujas) before that. 

Thus, the longest bhuja is included only in one sum. But the bhujà next lower 
than the last (bhuja) is included both in the last sum and also in the next lower 
sum. The bhujas below that are included in the three, four etc. sums below it. 
Hence, it would result that the successively smaller bujas commencing from the 
one next to the last, which have been multiplied by numbers commencing from 1 
and added together, would be summation of summations (sarikalita-sarikalita). 
Now, it has been stated earlier that the summation (sarikalita) of (the segments 
constituting) a bhuja which has been very minutely divided, will be equal to 
half the square of the last bhuja. Hence, it follows that, in order to obtain the 
summation (sarikalita) of the bhujas ending in any particular bhuja, we will 
have to square each of the bhujas and halve it. Thus, the summation of summa- 
tions (sarikalita-sarikalita) would be half the summation of the squares of all the 
bhujas. In other words, half the summation of the squares is the summation of the 
basic summation. So, when the summation is multiplied by the radius, it would 
be one and a half times the summation of the squares. This fact can be expressed 
by stating that this contains half more of the summation of squares. Therefore, 
when the square of the radius divided by two is multiplied by the radius and one- 
subtracted from it, the remainder will be one-third of the whole. Thus it 
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This may be written in the form 


ns — s 2 (n — 1)-E(n -2)-- (0 23) 4... | 
+(n —2)+(n—3) +... 41 
Jua) ENT | 
SS a. (93) ) 
Thus, | 
ns(D.— s = sO p sre serm (94) | 


The right hand side of (94) is called the sarikalita-sarikalita (or sankalitaikya), 
the repeated sum of the sums s (here taken in the order i = n— 1, n—2,...1). 
These are defined also by Sankara Variyar in Kriyakramakari as follows:”! 


qur fe agoa ant fe agaaga qa 
amago wate usar apr) surnendg od dg 

T gT: | smen, qia 
wgfed yeaa o hei yd oe 


vifem eher Safa ser ipis fien 


The sum of the summations is called as sarikalita-sarikalita. Of them cured 

sankalita is the sum all the bhujas. The penultimate sarikalita is the sum of 

all the bhujas other than the last one. The sarikalita of the one preceding the —  — 

penultimate is the sum of the bhujas ending with that. Thus, all the preceding 3 

sankalitas will fall short by a bhujà from the succeeding sarikalita. — "S E 
ie 


vr 


For large n, we have already estimated in (89); that St 


2 
ng) — s x elm x: 


Thus, the right hand side of (94) ee 70 


so 
over se i is essentially E for la 
fw v d 
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Again, using the earlier estimate (89) for si), we obtain the result 


n 
alae (97) 


. 3 
Thus bhujà-varga-sarikalita is one-third the cube of the radius. 


12.3. Sama-ghata-sankalita 


~ 


We now cite the following from the translation of Yuktibhasa:’* 


Now, the square of the square (of a number) is multiplied by itself, it is called 
sama-parica-ghàta (number multiplied by itself five times). The successive 
higher order summations are called sama-paricadi-ghàta-sarikalita (and will 
be the summations of powers of five and above). Among them if the summation 
(sarikalita) of powers of some order is multiplied by the radius, then the product 
is the summation of summations (sarikalita-sarikalita) of the (powers of 
the) multiplicand (of the given order), together with the summation of powers 
(sama-ghata-sarikalita) of the next order. Hence, to derive the summation of 
the successive higher powers: Multiply each summation by the radius. Divide it 
by the next higher number and subtract the result from the summation got before. 
The result will be the required summation to the higher order. 


ao 


ener ep el MeN ID 


(— REE Fn 
A 


. 


Thus, divide by two the square of the radius. If it is the cube of the radius, divide 
by three. If it is the radius raised to the power of four, divide by four. If it is 
(the radius) raised to the power of five, divide by five. In this manner, for powers 
rising one by one, divide by numbers increasing one by one. The results will be, 
in order, the summations of powers of numbers (sama-ghata-sarikalita). Here, 
the basic summation is obtained from the square, the summation of squares from 
the cube, the summation of cubes from the square of the square. In this manner, 
inm are multiplied by themselves a certain number of times (i.e., raised 


1 1) numbers, (their) squares etc. 


Add 
= 


a-sankalita, (summation of equal powers) given 


* (98) 
MASA a es > Axe 
ssentially the same 
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sankalitas Ses: 
nSQ-D — s .— s sera M (99) 


If the lower order sarikalita SU has already been estimated to be, say, 


SED a & (100) 
then, the above relation (99) leads to? 
nS&-) = s Dr TE ioe 
(3) s®. (101) 
Rewriting the above equation we have/^ 
SD ~ nsh? — G) sO. (102) 


Using (100), we obtain the estimate 


derivation of the following relation, which is based on the i 


fa — xyx*-ldx = i y* 


ns 


744s Sankara Variyar states in his Kriz 
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12.4. Repeated summations (Sankalita-sankalita) 


Lf After having estimated the sum of powers of natural numbers samaghàta-sarikalita 
Yuktibhasa goes on to derive an estimate for the repeated summation (sarikalita- 
sankalita or sarikalitaikya or varasarikalita) of the natural number 1, 2, . .. , n.75 


Now, are explained the first, second and further summations: The first summa- 
i tion (@dya-sarikalita) is the basic summation (müla-sarikalita) itself. It has 
already been stated (that this is) half the product of the square of the number of 
terms (pada-vargardha). The second (dvittya-sankalita) is the summation of 
the basic summation (müla-sarikalitaikya). It has been stated earlier that it is 
equal to half the summation of squares. And that will be one-sixth of the cube of 
the number of terms. 
Now, the third summation: For this, take the second summation as the last term 
(antya); subtract one from the number of terms, and calculate the summation 
of summations as before. Treat this as the penultimate. Then subtract two from 
the number of terms and calculate the summation of summations. That will be the 
next lower term. In order to calculate the summation of summations of numbers in 
the descending order, the sums of one-sixths of the cubes of numbers in descend- 
ing order would have to be calculated. That will be the summation of one-sixth 
of the cubes. And that will be one-sixth of the summation of cubes. As has been 
enunciated earlier, the summation of cubes is one-fourth the square of the square. 
Hence, one-sixth of one-fourth the square of the square will be the summation 
í of one-sixth of the cubes. Hence, one-twenty-fourth of the square of the square 
will be the summation of one-sixth of the cubes. Then, the fourth summation will 
be, according to the above principle, the summation of one-twenty-fourths of the 
square of squares. This will also be equal to one-twenty-fourth of one-fifth of 
the fifth power. Hence, when the number of terms has been multiplied by itself 
a certain number of times, (i.e., raised to a certain degree), and divided by the 
product of one, two, three etc. up to that index number, the result will be the 
A ion up to that index number amongst the first, second etc. summations 
screeds garikalsto). 


~- 
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mation | (adya-sarikalita) Vn ) is just the müla-sarikalita or the 
numbers, w which is already been estimated in (89) 
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As was done earlier, this second summation can be estimated using the estimate 
(89) for S, 5) 


2? (n-1)? | (n-2Y 
VO) ~ E p Se ee 106 
n > + E RE (106) 
Therefore 
VO x (5) se, (107) 
se 


Using the earlier estimate (97) for S, (2) we get an estimate for the dvitzya-sarikalita 


V2) a —. (108) 


Now the next repeated summation can be found in the same way 


ye = V2) 4 VO + yo +... 
3 3 3 
T Nm (n — 1) (n —2) 
dix: 7 awn 
1 v 
6) " "NÉ du 
4 rift 2B 


Q 


so ag Diba DO 


It is noted that proceeding this way we can estimate 
order k, for large n, to be/6 


vo 
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13. Derivation of the Madhava series for z 


The following accurate value of z (correct to 11 decimal places), given by Madhava, 


has been cited by Nilakantha in his Aryabhatiya-bhasya and by Sankara Variyar 
in his Kriyakramakari.” 


RARE TAA eae: | 
qafredfae afafret Aaaf serequr: N 


The z value given above is: 


2827433388233 


9x 101 = 3.141592653592... (111) 


The 13 digit number appearing in the numerator has been specified SERES Duis 
sankhya system, whereas the denominator is specified by word numerals." 


13.1. Infinite series for z 


The infinite series for x attributed to Madhava is cited by Sankara Variyar in his 
commentaries Kriyakramakari and Yukti-dipika. Madhava's verse quoted runs 
as follows:"? 


HEU aes 
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The words paridhi and vyása? in the above equation refer to the circumference 
and diameter respectively. Hence the equation may be rewritten as 


=(1-5+5 pes ) (113) | 


Ds 
4 S" 5 7 


o p 


wy 


FIGURE 8. Geometrical construction u 
infinite series form. — 


We shall now present the derivation of the : 
tibhasa of Jyesthadeva and Kr i 
pose, let us consider th 
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are the bhujas and O P;'s are the karnas denoted by k;. The points of intersection 
| of these karnas and the circle are marked as A;s. 

D T The bhujàs Po P;, the karnas ki and the east-west line O Po form right-angled 
i triangles whose hypotenuses are given by 


NI I ME fir? 

Bit k? =r? + (=) 3 (114) 
HH P" n 

$ | 1 i A 
l| 1E where r is the radius of the circle. 

i | | LH The feet of perpendiculars from the points Aj—; and P;—ı along the i th karna 
PE j f are denoted by B; and C;. The triangles O P;—ıC; and OA;-; B; are similar. 
i i ti Hence, 
| t ] | i Aj-1 Bi PEC 
MH = = (115) 

IH OAi-1  OPi-i 
| | ] f Similarly triangles P;_;C; P; and Po O P; are similar. Hence, 

f D + " 
eee , n 
WM I Ines Cy c HR (116) 
nt Ft Pi—ıP; OP; 


From these two relations we have, 


O Aj—1.O Po. Pj—1 Pj 


c ea A= 
— E O Pj 1.0 Pj 
lige OAj- O Po 
s tuu ; = JO rs OR x OP; 
esi END er Lr 
. n? ki ki 


ie 17 
Vk g ) 


et Aj-1 Bi can nbs taken as the arc-bits 
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Though this is the expression that actually needs to be evaluated, the text men- 


tions that there may not be much difference in approximating it by either of the 
following expressions: 


S olm) v 
iL. OG (9) = 


It can be easily seen that 


or 


H « e < H 5 (121) 
Shia, 8 8 fry 


In other words, though the actual value of the circumference lies in between the 
values given by (120) and (119) what is being said is that there will not be much 
difference if we divide by the square of either of the karnas rather than by the 
product of two successive ones. Actually, the difference between (120) and (119) 
is given by 
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Using (114) and (123) in (120), we obtain: 
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1e terms in (126) is a sum of results (phala-yoga) which we need to 
nn very large, and we have a series of them (phala-parampara) 
sitive and negative. Clearly the first term is just the sum 


e integral multiples of bhuja-khanda, 
ression for the circumference given above, 
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Now, recalling the estimates that were obtained earlier for these sarikalitas, 
when n is large, 


sk nol ae 
DX E (127) 


we arrive at the result? 
G ER es f 
—- 1-—--+---+-:-], 128 
5 r( 2 ia T ) (128) 
which is same as (112). 


14. Derivation of end-correction terms (Antya-samskara) Y 
It is well known that the series given by (112) for Ẹ is an extremely slowly con- 
verging series. It is so slow that even for obtaining the value of z correct to 2 
decimal places one has to find the sum of hundreds of terms and for getting it 
correct to 4-5 decimal places we need to consider millions of terms. Madhava 
seems to have found an ingenious way to circumvent this problem. The technique 
employed by Madhava is known as antya-samskara. The nomenclature stems 
from the fact that a correction (samskara) is applied towards the end (anta) of the 
series, when it is terminated after considering only a certain number of terms from 
the beginning. 


14.1. The criterion for antya-sarskara to yield accurate result 


The discussion on antya-samskarain both Yuktibhasa and Kengamdas nt 
mences with the question: k 


How is it that one obtains the value of the circumference more accurately by doing tu; v 
antya-samskara, instead of repeatedly dividing by odd numbers? dde dur 


262 K. Ramasubramanian and M. D. Srinivas 


The argument adduced in favor of terminating the series at any desired term, still 
ensuring the accuracy, is as follows. Let the series for 1 be written as 


T II. - 1 -1 1 
l Z =i sda et (1) 2 + (—1) —— 
EE n 3°67 a CDU FCD "e (129) 
‘By, 
} " where m is the correction term applied after odd denominator p — 2. On the 


other hand, if the correction term m is applied after the odd denominator p, then 


= - — 
C 
Nd eR a AP 


T 1 1 1 -3 1 -11 +1 1 
ec E E m Lp = -DF — 
d 1 Bate jv 1) peor 1) uds 1) n (130) 


If the correction terms indeed lead to the exact result, then both the series (129) 
and (130) should yield the same result. That is, 
1 1 1 1 1 
eae ieee = (131) 
613-12. Gh C9. 65 du 


is the criterion that must be satisfied for the end-correction (antya-samskara) to 
lead to the exact result. 


14.2. Successive approximations to get more accurate correction-terms 


The criterion given by (131) is trivially satisfied when we choose ap 2 = ap = 

2p. However, this value 2p cannot be assigned to both the correction-divisors?^ 

45-2 and dp SERES both the corrections should follow the same rule. That is, 
Aaa PINO yr à ta 

OT x 3 

og "TM ap-2 = 2p, => ap = 2(p zh 2) 
p =2p, > ap-2 = 2(p —2). 


a e pum to 2p by taking ap_2 = 2p — 2 


mber above the last odd-number divisor 


a32) —— 


) is meant the divisor of the correction 1 wn 
he n term. 
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measure of inaccuracy (sthaulya) E(p) is introduced 
1 1 1 
E(p) = 5 H =| sett (133) 


Now, since the error cannot be eliminated, the objective is to find the correction 
denominators a, such that the inaccuracy E(p) is minimised. When we set ap = 
2(p + 1), the inaccuracy will be 


1 1 1 
Ea le oles 


l 
— à — 134 
(p? — p) i 


This estimate of the inaccuracy, E, being positive, shows that the correction has 
been over done and hence there has to be a reduction in the correction. This means 
that the correction-divisor has to be increased. If we take ap = 2p + 3, thereby 
leading to ap_2 = 2p — 1, we have 


EEG It 

£0 = lat 92x31 
ENT. 
^ Gp a4p-3py? S 


Now, the inaccuracy happens to be negative. But, more importantly, it has a term 
proportional 9 p in the numerator. Hence, for large p, E(p) given by (135) varies 
inversely as p^. awhile for the divisor given by (132), E(p) as given by ( 134) aed 
inversely as p°. 

From (134) and (135) it is obvious that, if we want to reduce the inaccuracy and 
thereby obtain a better correction, then a number less than 1 has to be added to the 
correction-divisor (132) given above. If we voty edding a tipo Ps divided b ' the 


higher order contributions, we need an extr 
we take the correction divisor to be 
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Then, correspondingly, we have 


Er 4 (2p - 2) +4 
PET ap—2 = (2p —2) + ——— = -——————— . 137 
TRIN p ara  @p—2) E 
i | 2 We can then calculate the inaccuracy to be 
mi 
ui d ] l 1 
VM Hp) — | — SOLE E -(<) 
| ie 2p —2 
i 11 Qp-27 — Qp*2*1 
i Id 
EH — [ 4» ]. cept +64) 
i f (4p* + 16) 4p(4p* + 16) 
in | ] ert A 
In | 
I Wu = ———. (138) 
B (4p) 
n [ t 
li +i j | Clearly, the sthaulya with this (second order) correction divisor has improved 
ni ig considerably, in that it is now proportional to the inverse fifth power of the odd 


number.56 
i At this stage, we may display the result obtained for the circumference with the 


correction term as follows. If only the first order correction (132) is employed, we 
have 
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The verse due to Madhava that we cited earlier as defining the infinite series for 


5 is, in fact, the first of a group of four verses that present the series along with 


the above end-correction.9" 


TTE saya p Up du va cna: Í 
wer: AfA: ga aepedt ASAA: AT 


The diameter multiplied by four and divided by unity. Again the products of 
the diameter and four are divided by the odd numbers like three, five, etc., and the 
results are subtracted and added in order. 


Take half of the succeeding even number as the multipler at whichever [odd] 
number the division process is stopped, because of boredom. The square of that 
[even number] added to unity is the divisor. Their ratio has to be multiplied by 
the product of the diameter and four as earlier. 


The result obtained has to be added if the earlier term [in the series] has been 
subtracted and subtracted if the earlier term has been added. The resulting cir- 
cumference is very accurate; in fact more accurate than the one which may be 
obtained by continuing the division process [with large number of terms in the 
series]. 


Continuing this process further, Yuktibhasa presents the next order correction- 
term which is said to be even more accurate: 88 


At the end, [i.e., after terminating the series a 
term with] the multiplier being square of half of 
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Im. In other words,?? 


2 
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i E +1 
$ sR (o 0? «4 n (227 ) 
1 
| = —— ~. (141) 
1 Qp 4-2) 4 IE 
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i Hence, a much better approximation for 7 is:?° 
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; 8° The inaccuracy or sthaulya associated with this correction can be calculated to be 
Pan WM 7 ey 


2304 
—— (64p! + 448p? + 1792p? — 2304p) ` 


CODI nc C. Rajagopal a and M. s. Rangachari, ‘On an 
à "mati 1. for Hist. Sc. 35(2), 89-102, 1978), that the 
e following continued fraction 
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15. Transforming the Madhava series for better convergence | 


After the estimation of end-correction terms, Yuktibhasa goes on to outline a 
method of transforming the Madhava series (by making use of the above end- 
correction terms) to obtain new series that have much better convergence prop- 
erties. We now reproduce the following from the English translation of Yuk- 
tibhasa:?! 

Therefore, the circumference (of a circle) can be derived in taking into consider- 

ation what has been stated above. A method for that is stated in the verse 


The fifth powers of the odd numbers (1, 3, 5 etc.) are increased by 
4 times themselves. The diameter is multiplied by 16 and it is suc- 
cessively divided by the (series of) numbers obtained (as above). The 
odd (first, third etc.) quotients obtained are added and are subtracted 
from the sum of the even (the second, fourth etc.) quotients. The 
result is the circumference corresponding to the given diameter. 

Herein above is stated a method for deriving the circumference. If the cor- 
rection term is applied to an approximate circumference and the amount of inac- 
curacy (sthaulya) is found, and if it is additive, then the result is higher. Then 
it will become more accurate when the correction term obtained from the next 
higher odd number is subtracted. Since it happens that (an approximate circum- 
ference) becomes more and more accurate by making corrections in succeeding 
terms, if the corrections are applied right from the beginning itself, then the cir- 
cumference will come out accurate. This is the rationale for this (above-stated 
result). 

When it is presumed that the correction-divisor is just double the odd number, 
the following is a method to obtain the (accurate) circumference by a correction 
for the corresponding inaccuracy (sthaulyam$a-parihara), which is given by 
the verse: 


erar arvfafereara Vw ARATTA: | 
Aua AT FHI: FAT afra: bay 
The diameter is multiplied by 4 and is divided, successively, eas 
cubes of the odd ae MTM which are d 
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A one, are the divisors for four times the diameter. (Make the several 
divisions). The quotients got by (the division) are alternately added 
to or subtracted from twice the diameter. In the end, divide four times 
the diameter by twice the result of squaring the odd number following 
i the last even number to which is added 2. 

i 

| 


| pg ) The method of sthaulya-parihara, outlined above, essentially involves incor- 
Ht porating the correction terms into the series from the beginning itself. Let us recall 
P" that inaccuracy or sthaulya at each stage is given by 


1 
l Ae a () (143) 
i i 4p-2 Ap P 
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| } The series for the circumference (112) can be expressed in terms of these sthaulyas 
i H as follows: 
Á 
i 
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4d [(\-=)+2@- EG) E) -...]. (144) 


Now, by choosing different correction-divisors ap in (144), we get several 
transformed series which have better convergence properties. If we consider the 
correction-divisor (136), then using the expression (138) for the sthaulyas, we get 


1 1 1 
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verse. samapancahatayoh .. (D). Note that each 
| h power of the odd number in the denom- 


SENES more accurate results with fewer 
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32) and the associated sthaulya 
p series is the 
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Even if we take non-optimal correction-divisors, we often end-up obtaining 
interesting series. For instance, if we take a non-optimal correction-divisor, say of 
the form a, = 2p, then the sthaulya is given by 


1 1 1 


E eee 2 
(D). = 1 pF AY PNEU 
1 
(p? — 2p) 
1 


Then, the transformed series will be the one given in the verse dvyadiyujam 
và krtayo. . .(1I?? 


1 | 1 1 
c-«l +a entent] (148) 


16. Derivation of the Madhava series for Rsine and Rversine 
16.1. First and second order differences of Rsines 


We shall now outline the derivation of Madhava series for Rsine (bhuja-jya) and 
Rversine (sara), as given in Yuktibhasa?? Yuktibhasa begins with a discussion 
of the first and second order Rsine-differences and derives an exact form of the 
result of Aryabhata that the second-order Rsine-differences are proportional to 
the Rsines themselves. We had briefly indicated this proof in Section 5.3. 
Here we are interested in obtaining the Madhava series for the jya and sara of — 
an arc oe ie by EC in ee This arc is divided i to ke 


ina 
Hi 
Hr 
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] là The corresponding koti-jyā K j, and the sara Sj, are given by 
5 0 
Kj = koti (3) = Rcos (2) X (150) 
n n 
1 js JO 
| Sj; = Sara| —|-R|1—cos| — ]|. (151) 
n n 


Now, CjCj+1 represents the (j + 1)-th arc bit. Then, for the arc EC; = is, 
its pinda-jya is Bj; = C; Pj, and the corresponding koti-jya and sara are Kj = 
CjTj, S; = EP;. Similarly we have 


Bj+1 = CjtiPjHi, Kj = Cj+1Tj+1 and Sj41 = EPj +1. (152) 


| nd Sara by Sarikalitas. 


jCj+1 and similarly M; the mid- 
hall denote the pinda-jya of the arc 


ne kot2-1uà al nd Sara are 


M iU; 
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Similarly, 


Bj = MjQj, Kj = M;jU; and Sj-1 =EQ;. (153) 


Let a be the chord corresponding to the equal arc-bits 7 as indicated in Fig- 
ure 9. That is, C;Cj4; = MjMj+; = a. Let F be the intersection of CjTj 
and Cj+1ıPj+1, and G of M;U; and Mj+1Qj+1. The triangles Cj 4; FC; and 
O Q j41M j4.1 are similar, as their sides are mutually perpendicular. Thus we have 


Citi Cj, CHUTE Sie (154) 
OM;  OQju  QjuaMju 


Hence we obtain 


a 
Bj+ı — Bj = C: (155) 
a 
Kj Kiss Sint = Sj = (F) B (156) 


Similarly, the triangles Mj+ı GM; and O P;C; are similar and we get 


Thus we obtain 
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From (159) and (161), we also get the second order Rsine-differences (the differ- 
ences of the Rsine-differences called khanda-jyantara): 


| 
| ATLAS (By — Bj;21) — (Bj +1 — By) 
d S| 
i: i 

| 


| 
M 2. 
E = = Aem 
Ia ©) (Sja; 5,1) 
SEE a\2 
uS (aioe (162) 
ree ER 
H l i I ? Now, if the sum of the second-order Rsine-differences, is subtracted from the first 
ni i f Rsine-difference, then we get any desired Rsine-difference. That is 
l3 
ui i Ay = [(A1 — A2) + (A2 - A3) +... + (Aj-1 — ^9] = Aj. (163) 
^ | 1 § 
| | | ' From (162) and (163) we conclude that 
H , 
II! a2 
n = fet ps ERES 164 
E H A (3) (Bı + B2 +---+Bj-ı) = Aj. (164) 


16.2. Rsines and Rversines from Jya-sarikalita 


We can sum up the Rversine-differences (159), to obtain the sara, Rversine, at the 
e last arc-bit as follows: . 


= (S,-1 = 5,3) + Suus (5; — 51) 
=) Brat + Brot... + BD. (165) 


65) can also be expressed as a summation of 
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2 
-nAi- ($) [B1 + Bo...+ Bn-1) + (Bi + Bo... Baca) +... Bi] 


2 
-aB - (T) [Bii 285-2 - 0j]. (167) 


The results (158) — (167), obtained so far, involve no approximations. It is now 
shown how better and better approximations to the Rsine and Rversine can be 
obtained by taking n to be very large or, equivalently, the arc-bit = to be very 
small. Then, we can approximate the full-chord and the Rsine of the arc-bit by the 
arc-bit 7 itself. Also, as a first approximation, we can approximate the pinda-jyas 
Bj in the equations (164), (165) or (167) by the corresponding arcs themselves. 
That is 


Byes (168) 


The result for the Rsine obtained this way is again used to obtain a better approx- 
imation for the pinda-jyas B; which is again substituted back into the equations 


led to estimate the sums and — sums sf natural numbers (e 
sankalita), when the number of terms is very large. 3 


As we noted earlier, the relations giv 
shall show how better and better a 
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the above relations (165), (167) become?? 
A 8 S 
MA S=S, X (zx) (Bn-1 + Bn-2 +... + B1), (169) 


TRI * Y tB 4-B B 
| ni = B=B, © (>) [(B1 + B2 +... + Bn-1) 
Hi | + (Bı + B2... + Bn-2) +... + Bi], (170) 
i ' where B and S are the Rsine and Rversine of the desired arc of length s and the 
AAA results will be more accurate, larger the value of n. 


Now, as a first approximation, we take each pinda-jyà B; in (169) and (170) to 
be equal to the corresponding arc itself, that is 


AM 
TET 

"t 

i | Bye. (171) 
|| 5 1 Then we obtain for the Rversine 
nim 

mi | s eote] 

i Hi = (x) @)te-n+e-94.4 (172) 


For large n, we can use the estimate (89) for the sum of integers. Hence (172) 


reduces to 
1\ s2 
Si —. 173 
: B a 


-.-(7—2))+...]. (174) 


t E wd 
in the following derivation instead of. using 


rindajyas, ees use the auch ie 
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The second term in (174) is a duitiya-sankalita, the second order repeated sum, 
and using the estimate (108), we obtain 


1\? 33 
B~s- (5) , (175) 


Thus we see that the first correction obtained in (175) to the Rsine-arc-difference 
(jya-capantara-sarskara), is equal to the earlier correction to the Rversine ( sara- 
samskara) given in (173) multiplied by the arc and divided by the radius and 3. 

It is noted that the results (173) and (175) are only approximate (prayika), 
since, instead of the sarikalita of the pinda-jyas in (169) and (170), we have only 
carried out sarikalita of the arc-bits. Now that (175) gives a correction to the 
difference between the Rsine and the arc (jya-capantara-sarmskara), we can use 
that to correct the values of the pinda-jydas and thus obtain the next corrections to 
the Rversine and Rsine. 

Following (175), the pinda-jyas may now be taken as 


DEN 

2 Ls 

js 1\~ (4) 
pu A euin 176 
PEE (x) 1.2/3 (106 


If we introduce (176) in (169), we obtain 
l 5\2 
S m (x) (=) [(n —1) +(n—-2) +...] 
n 


E) (+) ey Ga) [n — 3 4: (n —2? +... 072) 


The first term in (177) was already evaluated while deriving (173). The second 
term in (177) can either be estimated as a summation of cubes (ghana-sankalita), 
or as a £rtiya-sarikalita, third order (repeated) summation, because each individ- 
ual term there has been obtained by doing a second-order (repeated) summation. 
Hence, recollecting our earlier estimate (110) for these sarikalitas, we get 


1\ s? AS 
Ss (z) Ta (x) 1.23.4 US) 
Equation (178) gives a correction ($ara-samskara) to the earlier value (173) of 
the Rversine; which is nothing but the earlier correction to the Rsine-arc difference 


(jya-capantara-samskara) given in (175) multiplied by the arc and divided by the 
radius and 4. 
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li h | Again, if we use the corrected pinda-jyas (176) in the expression (170) for the 
E, Rsine, we obtain 
it 
iif $ 1M 5543 
M M I B x s—(—) (2) (ae2*-0-Deas24--0-2)-1 
En n! 
5 I! | 1 4 s\5 
2i *(x) G) 
a ? 1 

H Li x (155) [+P e - 0 ec em ese -25.] 
miu eo. Aides, 5 
| " 1 sS 1 S 
n HIR. GS = EX 

! | ! i (s 123 * (3) 1234.5 ` r2) 
Wl 

"j ] The above process can be repeated to obtain successive higher order correc- 


samskara) for the difference between the Rsine and the arc, using this correction 
to correct the pinda-jyas Bj, and using them in equations (169) and (170) get the 
next correction (Sara-samskara) for the Rversines, and the next correction (jya- 
capantara-samskara) for the Rsine-arc-difference itself, which is then employed 
to get further corrections iteratively. In this way we are led to the Madhava series 
for jyà and sara given by 


beds: Ns EE 
g :- (x) ass * (x) (234.5) 
INE EZ 
12) Pany A A 3 s4 
ior (z) (123.4) 
DES spU 6 | 
HE JT REED NIRE (180) 


a C TEP . - : 
a1 E . m 


À 
$: 
| i tions for the Rversine and Rsine: By first finding a correction (jya-capantara- 
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17. Instantaneous velocity and derivatives 


As we saw in Section 6.1, the mandaphala or the equation of centre for a planet 
A is given by 


Rsin(Ag) = (S) Rsin(M — a), (182) 


where ro is the mean epicycle radius, M is the mean longitude of the planet and a 
the longitude of the apogee. Further as we noted earlier, Munjala, Aryabhata II 
and Bhaskara II used the approximation 


Rsin(Au) & Ay, (183) 


in (182) and obtained the following expression as correction to the instantaneous 
velocity of the planet: 


d ro d 
5; (AH) = (R) Reos(M — a)—(M — a). (184) 


Actually the instantaneous velocity of the planet has to be evaluated from the more 
accurate relation 


Au = Rsin7! (S) Rsin(M — a)| (185) 


The correct expression for the instantaneous velocity which involves the derivative 
of arc-sine function has been given by Nilakantha in his Tantrasarigraha.96 


Let the product of the kotiphala [rg cos(M — a)] in minutes and the daily motion 
of the manda-kendra (72) be divided by the square root of the square of 


the bahuphala subtracted from the square of trijyā (ve — rĝ sin? (M — a). 


The result thus obtained has to be subtracted from the daily motion of the Moon 
if the manda-kendra lies within six signs beginning from Mrga and added if it 


96 Tantrasañgraha, cited above (fn. 52), verses 2.53-54, pp.169-170. Elsewhere, Nilakantha 
has ascribed these verses to his teacher Damodara (Jyotirmīmāmsā, Ed. by K. V. Sarma, VVRI, 
Hoshiarpur 1977, p. 40). 
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lies within six signs beginning from Karkataka. The result gives a more accu- 
rate value of the Moon's angular velocity. In fact, the procedure for finding the 
instantaneous velocity of the Sun is also the same. 


If (M — a) be the manda-kendra, then the content of the above verse can be 
expressed as 


$ d(M — a 
: d Zo ro cos(M — a) a 
fi = [sin (sinu — a)) | = —————— —2—. (186) 
i, : R R? — rê sin? (M — a) 
i 1 
] ig The instantaneous velocity of the planet is given by 
d(M — a) 


1 d ro cos(M — a) 
i DNE en) p dg 
It TI i ; R? — rê sin? (M — a) 


second term the rate of change in the mandaphala given by (186). 
In his Aryabhatiya-bhasya, Nilakantha explains how his result is more correct 
than the traditional result of Muñjāla and Bhaskaracarya:?’ 


Ad: TANA qui? JRA at fas: qa atsan- 
qur Boar eH, gg Hera Hai 
=i SaaS ta mrs eres sm. 
emcees tic oT? 


ESN Gaw 


! 

$ 
I et Here, the first term in the RHS represents the mean velocity of the planet and the 
| 


be divided by E [but] here 
tiphala and the rate of iut of 


Lüonpholaketym* 
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17.1. Acyuta's expression for instantaneous velocity involving the derivative 
of ratio of two functions 


In the third chapter of his Sphutanirnayatantra, Acyuta Pisarati (c. 1550— 
1621), a disciple of Jyesthadeva, discusses various results for the instantaneous 
velocity of a planet depending on the form of equation of centre (manda-samskara). 
He first presents the formula involving the derivative of arc-sine function given by 
Nilakantha (in the name of (manda)-sphutagati) as follows:” 


ife wogde daa artormifesummei 
ear eaaa pefe: 1 


Acyuta also gives the formula for the instantaneous velocity of a planet if one 
were to follow a different model proposed by Munjāla for the equation of centre, 
according to which mandaphala is given by 


ay sin(M — a) 
A= R (188) 


(1 — È cos(M -a)) 


instead of (182), where Ay is small. If one were to use this formula for man- 
daphala for finding the true longitude of the planet, then it may be noted that the 
instantaneous velocity will involve the derivative of the ratio of two functions both 
varying with time. Taking note of this, Acyuta observes: 0 


The procedure that was prescribed earlier is with reference to the School that 
conceives of the increase and decrease in the circumference of the manda-vrtta 
in accordance with the karna. With reference to the School that conceives of 
increase and decrease only according to the half [of it], now we prescribe the 
appropriate procedure to be adopted. 


Acyuta then proceeds to give the correct expression for the instantaneous velocity 
of a planet in Munjala's model:!?! 


99 Sphutanirnayatantra of Acyuta Pisarati, Ed. by K. V. Sarma, VVRI, Hoshiarpur 1974, 
p. 19. 

l00pbiq. p. 20. 

10lTbid., p. 21. 
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Having applied the kotiphala to trij yà [positively or negatively depending upon 
the mandakendra], let the square of the dohphala be divided by that. This may 
be added to or subtracted from the kotiphala depending on whether it is Mrgadi 
or Karkyadi. The product of this [result thus obtained] and the daily motion of 
the manda-kendra divided by the kotiphala and applied to trijyà will be the 
correction to the daily motion. 


Thus according to Acyuta, the correction to the mean velocity of a planet in 
order to obtain its instantaneous velocity is given by 


ro e sin(M — a) 
(S cos(M — a)) + ma E 
| | s (1 — Z cos(M — a)) a 


which is nothing but the derivative of the expression given in (188). 
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Notes on Yuktibhasa: Recursive 
Methods in Indian Mathematics 


P. P. Divakaran 


1. Overview 
1.1. Yuktibhasa 


It has taken a long time for historians of mathematics to move from curiosity to 
uncertain admiration to well-informed scholarly recognition of the brilliance of the 
mathematicians/astronomers who lived and worked in Kerala (on the southwest 
coast of India) from the last quarter of the 14th century CE until the end of the 
16th. As the undertaking of bringing to critical attention the totality of their work 
gathers pace, one thing has already become clear: it can no longer be doubted 
that the high point of their mathematical achievements is the invention of calculus 
and its systematic and sophisticated development for application to trigonometric 
functions. Three personalities have emerged as key figures in this story. The first 
of course is the still shadowy Madhavan (Madhavan Emprantiri, or Madhava in 
Sanskritised form; throughout this article, I employ the Malayalam way of writing 
names from Kerala, with a terminal n), the founder of the school, who is credited 
by his followers with having created much of their strikingly original mathematics 
and whose creative genius pervades everything that they subsequently did. Then 
we have the polymath Nilakanthan (Nilakantha Somayaji), the pivotal link between 
Madhavan and the later generations and the author of a large body of surviving 
Work, notably Tantrasamgraha (TS from now on). And the third is Jyesthadevan 
who wrote what can accurately be called the first textbook of calculus, Yuktibhasa 
(YB from now on). YB is a comprehensive account, in its last two chapters, of the 
fundamental principles of integral and differential calculus (in that natural order in 
the Kerala approach to calculus), as well as the relationship between them, and their 
use in the study of trigonometric functions. 

Until recently, TS has served as the urtext for students of the final flaring of 
the long tradition of Indian mathematics and astronomy that the Kerala school 
represents. It is a compendium of the new mathematics of Madhavan and of its use 
in astronomy, in a compact Sanskrit verse format, and itis precisely dated (1500 CE) 
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at the midpoint of the relevant period. Within a relatively short period of about half 

a century after its writing, a detailed commentary appeared in two separate works 
Wi by Sankaran (Sankara Variyar), Yuktidipika and Laghuvivrti, providing details of 
r i several of the constructional and computational rules of TS.! YB, though it advertises | 
| itself in the opening sentence as following TS, is quite a different type of work. Rather | 
than being a commentary (vyakhyd or bhasya), it is a stand-alone exposition of the 
material covered in 7S, segregating (unlike TS itsef) the mathematics in a first part 
and its use in the construction of an accurate planetary model in the second. The 
Jm mathematical Part I is all that will concern us here and what the abbreviation YB will 
im refer to. It is distinctive in other ways too: at a time when all scientific texts were 
| 1 composed in Sanskrit, often in terse verse, it was written in Malayalam prose which 
| 


nai 


| is anything but terse, it is self-contained, and it provides reasoned justifications 
H nti, (yukti, which I will use interchangeably with ‘proof’ from now on)? for all results 
ki i ! cited. In other words, YB gives us a portrait of the mathematical culture of its 
ii HI time as few other Indian texts of any period do. And since it also cites all earlier 
Wil developments relevant to its concerns, it is an ideal vantage point from which to 
Ni survey the entire landscape of Indian mathematical thought of which the work of | 
MIEG the Kerala school forms the final summit. An English translation of YB by the late 
! K. V. Sarma has recently been published [4], enabling scholars without Malayalam 
i to finally get to know it almost at first hand?. The present article has taken advantage 
of its publication by assuming that the reader will have it to hand; I shall cite it 
| frequently as [Sarma], along with the relevant section numbers, even though the 
translations of passages that often accompany the citations are my own. 


- lAlithree texts have been critically edited and published in the original Sanskrit in one volume by K. ! 
V. Sarma (with an introduction in English) [1]. For detailed accounts in English of some of the questions | 
deg concerned with, see individual articles in [2]. | 

is is n not to be taken to mean that the two terms refer to identical processes, either logically or, > 

tl very differ ta hes in the Indian and European traditions to the enterprise of acquiring | 

kno vledge, epistemologically. It will not be misleading to think of yukti as proof by other | 

ya universally practised sequence of steps starting with axioms. This point has | 
of muc ssioned discussion recently. The reader will find a detached evaluation of | 


aiam, there i eis the. admirable and Sra edition (of Part I) of Rama 
à ar[5] (cit cited often i in the rest of this article as [TA]) with diagrams, 
> parts of the text, now 60 years old and out of print; 


Sop 


ithout exception, owes a debt of gratitude to this Meticulous 
j] has an excellent account of the details of the calculus work 


T i 1 Sarma has chosen to title his trang] on 
almos 
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Indian mathematical reasoning is as striking in its avoidance of certain tech- 
niques, for instance reductio ad absurdum methods, as for its partiality to certain 
others. This article is about what we can learn from YB and its recapitulations of 
earlier material about one such widely favoured general procedure, that of recur- 
sion. Three broadly distinct manifestations of the general idea of recursion (for the 
sense(s) in which I use this term, see below, section 1.7)* find significant use in YB: 
recursive description, recursive computation and construction and recursive proof. 
The term recursive description does duty also for what in modern logic will be called 
recursive definition — as is well known, Indian thought, and certainly Indian mathe- 
matics, did not much care for ab initio definitions. Moreover, it is not an infrequent 
experience while going through YB to come across recursive characterisations of 
mathematical objects, the coefficients in a power series for example, arising in the 
course of a computation. This, and the fact that Indian proofs are largely construc- 
tive, make it not always easy to disentangle description, computation/construction 
and proof in a given piece of writing. 

Recursive ideas run through Indian mathematics from before the time of 
Aryabhata, notably in the perfecting of the decimal place-value notation for the 
natural numbers. But it is in the intricate reasoning that goes into the development 
of calculus that they finally find their most sophisticated expression, their natural 
home as it were. This has a reasonable explanation. Recursive processes are, gener- 
ally speaking, nonterminating as is the infinitude of numbers; indeed, in the context 
of Sanskrit linguistics, it was recognised very early, presumably by Panini (5th C. 
BCE) himself and certainly by his commentator Patarijali (2nd-1st C. BCE), that 
a science of language based on a finite set of recursive rules applied to a finite set 
of linguistic units has the capacity to describe its structure in all its unbounded 
richness. Patafijali’s graphic way of illustrating this principle is worth quoting (see 
Staal [7], p. 40): 


Now if grammatical expressions are taught, must this be done by the recitation 
of each particular word for the understanding of grammatical expressions . .. ? 
No, says the author, this recitation of each particular word is not a means for the 
understanding of grammatical expressions. For we have a tradition which describes 
how Brhaspati addressed Indra during a thousand divine years going over the 
grammatical expressions by enumerating each particular word, and still did not 
attain the end. With Brhaspati as the professor, Indra as the student and a thousand 
divine years as the period of study, the end could not be attained . .. The recitation of 
each particular word, therefore, is nota means for the understanding of grammatical 
expressions. How, then, must grammatical expressions be understood? Some work 
containing general and particular rules must be composed. 


In the realm of mathematics, the discipline of calculus cannot, by the nature 
of its very content, but deal with the infinite and its opposite, the infinitely small. 


4Numbers in boldface refer to the sections and subsections of the present article. 
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It is no surprise then to find that YB accommodates and utilises infinitely iterated 
recursions in more than one role. But, as will become evident by the end of this 
article, not all of the recursive methods it calls upon are in response to the demands 
of the infinitesimal geometry which is the particular avatar that calculus takes in 
YB, nor do they all lead to infinite processes. What they do have in common is the 
constant presence of a certain spirit, that of the decimal place-value number system. 
It is for this reason that the next section 2 is devoted to a retracing of this frequently 
traversed path, ahead of the technically more demanding mathematical aspects in 
the subsequent sections. In the present section 1, which is an overviw of the entire 
material, I have tried to avoid overly technical matters to the extent possible so as to 
give a reader who is not inclined to delve into mathematical details a general idea 
of the mode of thinking that animated Indian mathematics. 


1.2. Gestures and Sounds, Ritual and Language 


Going back as far as records and reliable tradition allow, three easily identified land- 
marks in the increasingly productive uses of recursive methods in Indian thought 
can be noted. The first began, probably before the earliest vedic works were com- 
posed, with the rules that govern all vedic rituals and effectively culminated in 
Panini's systematisation of the grammar and syntax of the Sanskrit of his time. 
An extensive ritual such as a yaga performed over several days is built up from its 
component elements in several stages of organisation and sequencing, the whole 
process controlled by strict rules which specify how subunits of the ritual, at various 
levels of organisation, may be combined, repeated, substituted, inserted one into 
another, etc., i.e., how they may be operated upon. That the number of repetitions 
of an operation is also specified we may take to be a performative limitation, not 
one of principle. Very similar rule-based structures are imposed also on the artic- 
ulation of sounds (mantra) that accompany the gestures and actions of the ritual. 
It would seem then that a ritual is fundamentally a ‘syntactically’ defined object, 
perhaps serving certain functional ends, its ‘semantic’ signifiance being secondary; 
in other words, we are talking about "rules (largely) without meaning". One may 
even argue, more generally, that divorcing sense from symbol bestows freedom; that 
attaching meaning to each component part of a recursively constructed object or 
process, as opposed to an overall purpose, might well hinder the realisation, through 
a process of abstraction, of the full power and flexibility afforded by rules without 


meaning. Nothing illustrates the power of rule-based abstraction more dramatically 
than modern mathematcs. 


5tt will be evident to the informed reader that my understandin i 
3 g of Indian rule-b inki 
its impact on mathematics has been strongly influenced by the work of Frits Staal ased thinking and 
6 : : taal. 
A faithful and comprehensive record of an actual performance of a com : : : à 
Staal's magnificent work Agni [8]. Plex ritual is available in 
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As for Panini, the quotation above from Pataiijali says it all, or virtually all, 
about the indispensability of having a finite set of rules which may be applied 
over and over again without bound as the only means of imposing structure and 
regularity on a natural language. It needs to be said that Panini's principles cover all 
aspects of Sanskrit as a language-in-use, their power and depth already evident at 
the elementary level of the small collection of sandhi rules which describe/prescribe 
the way words (primarily their sounds; meaning is hardly relevant) are combined 
and in the process transformed. It is possible, though not of great significance for 
us, that these and other rules were a conscious adaptation, from the ritual syntax that 
apparently pays little heed to meaning, to the setting of a natural language which 
cannot possibly dispense with meaning at a fairly low level of organisation, that 
of a sensible word. The other difference is that, as Patanjali's fable highlights, a 
language does not have any finiteness limitation. 


1.3. One Thing after Another... 


The second landmark is without doubt the adoption of the place-value system 
with 10 as base for the naming, writing and manipulation of integral numbers. 
It is fairly certain that this entailed a long drawn out evolution, beginning with 
the first mentions of number names in the earliest texts’ and arriving at a mature, 
economical and consistent model for numbers by the time of Brahmagupta (7th 
C. CE). Along the way, all the features that mark what the Arab mathematicians 
called the Hindu numbers became incorporated: the role of zero both as a numeral, 
the one preceding 1, and as the cardinality of the empty set as in the place-value 
notation; the systematic ordering and naming of numbers going up to values beyond 
any practical utility, making the idea of the infinite almost palpable as one might 
say; fractions and the rules for operating with them; negative numbers and their 
arithmetic properties, and so on. It is this finished product that was transmitted 
to Europe from the Arab world, beginning in the 12th-13th century, by Leonardo 
of Pisa (Fibonacci) and possibly others. The fascination that the “discovery of the 
zero' has held for European historians has perhaps tended to turn our attention away 
from the role of these other elements in making the decimal system the universally 
adopted basis for human numeracy. Things are beginning to change now. (For the 
early history of the zero, see Staal [10] and for the impact that a proper understanding 
and use of negatives had on Indian arithmetic and geometry, see Mumford [11], 
both in this volume). 


TA succinct historical account will be found in the book of Datta and Singh [9] (Vol. I). Citing 
instances from the Rgveda in the well known Vedic Index of Macdonell and Keith, it says on page 9: 


"We can definitely say that from the very earliest known times, ten has formed the basis of numeration 
in India", 
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It is often assumed that the idea of a place-value number system came to India 
from the Babylonian sexagesimal numbers. The reasons for the belief are, firstly, 
thatthe sexagesimal system had attained a degree of completion by about 1800 BCE, 
some centuries before the first vedic mentions of numbers in the Rgveda. Then, the 
decimal notation itself is foreshadowed in the Babylonian use of a single character 
for 10 in the writing of sexagesimal numbers. Finally there is the prevalence in 
Indian astronomy of successive divisions by 60 in measuring time as well as the 
circumference of the circle. There may well be an element of validity in each of 
these reasons but, given our poor knowledge of the relevant history, none can be 
taken to be a decisive argument. For instance, the astronomical use of 60 can only 
be dated back to Aryabhata (born 476 CE) who, on the time scale of interest here, 
is a late figure. It is perhaps not an accident that the advent of 60 in the study of the 
(celestial) sphere and the measurement of time coincides with the sudden blooming 
of astronomy associated with him, plausibly through contact with the Alexandria 
of Ptolemy, and had no influence on a pre-existing decimal system — nothing non- 
astronomical has ever been counted in units of 60 in India. The prudent conclusion 
has to be that we know too little about the historical factors bearing on the issue 
to take a position, keeping in mind the risk of mistaking causality (the principle 
that says that the past cannot be influenced by, but may influence, the present) for 
causation. I shall return to the question of the uncertain history later (section 2.4). 
Whatever its antecedents, the decimal system is the first purely mathematical ex- 
pression of the power of recursive construction in the Indian context. Its significance 
for all mathematics, indeed for virtually all human activity, and for all time to come, 
can only be described as seminal. Even though it took some four centuries after its 
introduction by Fibonacci for it to definitively displace the laborious methods of 
doing arithmetic practised by European savants, the speed of its subsequent spread 
was rapid. By the end of the 18th century, Laplace was writing enthusiastically about 
it, as in the oft-quoted tribute: “The ingenious method of expressing every possible 
number using aset of ten symbols (each symbol having a place value and an absolute 
value) --- seems so simple nowadays that its significance and profound importance 
are no longer appreciated" and, further, “The idea of expressing all quantities by 
nine figures whereby is imparted to them both a position and an absolute value is 
so simple that this very simplicity is the reason for our not being sufficiently aware 
how much admiration it deserves".? An even more significant tribute to its power, 
not so much in the eloquence of his words as in the remarkable use he put it to, came 


Spost-revolutionary France officially adopted (in 1799) decimally ordered (metric) units for length 
and mass mainly as a result of the advocacy of Laplace (along with Lagrange). The sexagesimal division 
of time was obviously too well entrenched to be meddled with. The opposite happened in India much 
earlier: the adoption of the new sexagesimal time divisions in the 5th century CE barely touched the 
decimally organised counting numbers dating back to the Rgveda or the possibly equally ancient but 
less systematic multiples and subdivisions of lengths and weights. 
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a century earlier from Newton. Understanding that use, in calculations with power 
series, needs the context to be prepared and I will come to that in a while. 

What the decimal system first does in practice is to take a notion of a (countable) 
quantity, which humans apparently have an innate faculty for when the numbers 
are small but becomes uncertain and hard to manage as they increase, and give 
it the same capacity for precision no matter how large the numbers are. That this 
problem was a preoccupation is clear from a number of early vedic passages. Some 
of them refer to situations in which numbers are matched or paired off without 
their magnitudes being specified, implying an understanding of when two numbers 
are equal even if each is not known ‘absolutely’, which is a plausible starting 
point in trying to quantify a discrete set. Then we have the verse from the Rgveda 
quoted by Staal in [12] and first highlighted by Renou, about the fire god Agni 
being the only being capable of apprehending precisely, we are not told how, large 
numbers. Over a period of time, but metaphorically at one stroke, the new way of 
counting gave people (and gods, not counting, naturally, Agni), the means to deal 
with arbitrarily large numbers with the same facility with which they dealt with 
small numbers. (That decimal counting did not completely supplant comparison 
counting at one stroke may be the message of the well-known passage from the 
Yajurveda in which a list of powers of 10 goes hand in hand with a pairing off 
of cows and sacrificial bricks; for the full passage, see for instance [13], p.131). 
One might say, from a modern viewpoint, that it replaces an abstract recursive 
definition — no one does any arithmetic starting from the Dedekind-Peano axioms — 
by a recursive constructive procedure of limitless power. Not the least significant 
aspect of this taming of numbers is that it goes hand in hand with their naming, 
by resorting to a relatively small collection of simple proper names for the atomic 
numbers? 1 to 9 as well as for 10 and its powers, supplemented, ideally, by two 
clearly differentiated rules for combining these names, one for addition to and the 
other for multiplication of powers of 10. And to name is to bring into being, to 
make exist. The wonder of this achievement is still alive in the pages of YB: in the 
very first section, after quoting a verse from Bhaskara II on the names of powers 
of 10 upto parárddham (10'”), there occurs the statement, “If we endow numbers 
with multiplication and positional variation (sthanabhedam), there is no end to the 
names of numbers; hence we cannot know [all] the numbers themselves and their 
order” [Sarma 1.4]!°. The rest of the opening chapter is then devoted to a concise 
but complete account of the basic arithmetical operations on decimally written 
numbers. 


More generally, in a place-value system with base b, I use this term for the numbers 0 to b — 1, 
mindful of the association of the term 'digit with the base 10. 

l0My translation; Sarma’s version deviates apparently slightly but, for our purpose, crucially. 
Throughout this article, all unattributed translations are mine. The Sanskrit-Malayalam words and 
phrases cited are spelt as they are in YB and in Malayalam generally. 
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1.4. The Infinite and the Infinitely Small; Calculus 


Georges Ifrah's book [14] has a table giving the names of powers of 10 which 
goes up to 10?! , While it is difficult to be extremely precise about the antiquity 
of the very many lists of names of powers of 10 made by Hindus, Buddhists and 
Jainas alike (see for instance [9] or Hayashi's book on the Bakhshali manuscript 
[15]), all of them probably antedate the 3rd century CE or so!! by which time 
sectarian Buddhist texts were being written in prolific quantity, many containing 
cosmogonic speculations or stories about the omniscience of the Buddha involving 
large, enormously large, numbers. (If anything, the Jainas were even more number- 
| happy). For a while, it would appear that such semi-quantitative ideas of the very 
Í large coexisted with qualitative notions of the infinite as expressed in the oft-quoted 
| i upanisadic passage concerning the concept of parnam (full or complete) as the 
$ quality of being unchanged when something is added or taken away, or in the 
l early Buddhist canonical references to the Buddha’s agnosticism about the spatial 
| and temporal boundedness or otherwise of the universe. The number lists are just a 
witness to the dawning realisation that there was now a bridge to infinity, constructed 
span by decimal span; a bridge without end, but one that would carry them as far as 

| they wished to go. 
By the time of Bhaskara II (12th century CE), Indian mathematicians, who were 
f | also astronomers most of them, no longer felt the need to glory in their mastery 
of astronomical numbers. The list of numbers in his Lilàvati stops at parárdha 
(which is now 10!) and for YB, 10!” is a perfectly acceptable point at which to 
halt in the interminable journey and do the rest of it in the imagination: YB leaves 
no doubt that it understands the resort to a fixed large number, as a substitute 
for an infinitely large one, to be a matter of practical convenience and ease of 

explanation’? 

— Atthe other extreme, the infinitely small caused no difficulty at all, once the ap- 
proach to infinity itself was brought under control: as a number was taken to larger 
gr oss 


Hindu texts that are very much earlier — the Yajurveda has a 

sts name all the consecutive powers of 10, stood at that time 

DOR cH antiquity and provenance [14, 15] and they do not all 
4 SURE $4 3* 7 . 

d the calculus part of YB is the infinite series for x. The derivation of 

nown now, by dividing half the side of the circumscribing Square of the 

c n tend to infinity (see further down in this subsection). 

hich says in part, “The smaller the 

hh side is cut into atoms and 


M. 
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and larger values, its reciprocal came closer and closer to zero. In a general sense, di- 
vision (at least by a non-zero number) and the concomitant fractional numbers seem 
to have posed no conceptual problems in India from an early time, just as subtraction 
and negative numbers did not. Even in YB, the operation of division is dismissed in 
a very brief paragraph after an explanation in detail of several different ways of car- 
rying out multiplication. Similarly, the chapter on fractions is only concerned with 
the mundane computational aspects of extending the basic operations on integers to 
fractions — mostly about finding a common denominator. When these operations are 
subsequently applied to geometric quantities, we see the same pragmatism in play. 
A segment of arbitrarily small size of a straight line or an arc was arrived at simply by 
cutting it into a number n of equal pieces and then letting n increase without bound. 
We might say that the only limit that explicitly occurs is the limit — oo, never the 
Newtonian limit x — 0; no philosophical energy needed to be spent on defining or 
describing ab initio the notion of an infinitesimal. The contrast with the European 
story of calculus, where ‘local’ questions such as the problem of tangents (finding a 
sequence of chords to a curve that approached the tangent at a point) and of extrema 
(determining the points at which a function or its curve attained its local maxima 
and minima) were from the beginning in focus, is clear enough to need no further 
comment. 

The derivation of the series for z is a good illustration of this approach.'? 
The problem is that of finding the length of (or rectifying) an arc of the circle 
(of unit radius) equal to one-eighth of its circumference, i.e., subtending an angle 
0 = 7/4 at the centre, and it is in its solution that the numerical series for z and 
its generalisation, the arctangent series, (described later in section 3) first saw the 
light of day. Schematically, the method consists in cutting up the unit tangent to 
the circle at any point (tanz/4 = 1) into n equal parts, approximating the arc- 
segement corresponding to each tangent-segment by its chord (more accurately, 
by half the chord of twice the arc, a little twist that may justly be considered the 
founding step of Aryabhata's trigonometry), adding them up and finally letting 
grow large. Right at the start of the description, there is the remark, “The greater 
the number [i.e., n], the more accurate the circumference”. (YB treats the case of 
the general arctangent series later by means of an elementary property of similar 
triangles; here I do not distinguish between the general and the particular cases). 
So it is clear that the procedure was effectively understood to be what we will 
consider today a specially simple instance of the use of the fundamental theorem of 
calculus: to determine a function explicitly, first differentiate it and then integrate 
the derivative. But the procedure did not, as it did not later for Gregory and Leibniz, 


'SMost of the rest of this section is, unavoidably, about the actual mathematical content of YB. 
Though I have kept equations at bay, it will be useful to have in hand an account of the mathematics 
as given for instance in the supplement of explanatory notes [17] by Ramasubramanian, Srinivas 
Sriram (abbreviated from now on to [RSS]) in [Sarma]; better still, come back here after reading the rest 
of the present article. 
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give a closed expression for the circumference. A variety of recursive tools are now 
put to work. First, each term in the sum over the segments, before the large n limit 
is taken, is itself turned into an infinite series by means of an elementary recursive 
identity, a result which continues to hold in the limit. (The recursive identity will 
get its fair share of attention in the following sections). Though there is nothing 
said about the possibility of the series not converging at this point (it is dealt with 
subsequently), the student/reader is left in no doubt that the sum of the whole series 
is the exact length of the arc which one may then approximate by truncation at 
any point. In this step, the debt that the free use of the (actually) infinite series 
owes to the (potentially) infinite series of powers of 10 which are the decimal 
representations of integers appears direct and undeniable. Over a century after the 
writing of YB, the story repeated itself in Europe in a different guise when Nicolaus 
Mercator and Isaac Newton first wrote down the logarithmic series and the binomial 
series. Newton felt the need to provide a rationale for his formal manipulation of 
infinite series from “the doctrine recently established [ ! ] for decimal numbers" 
in a revealing passage from De Methodis Serierum et Fluxionum written in 1670- 


1671 (translated by Whiteside [18], vol. III, p.32) and it merits being quoted in 
extenso: 


Since the operations of computing in numbers and with variables are closely simi- 
lar— indeed there appears to be no difference between them except in the characters 
by which quantities are denoted, definitely in the one case, indefinitely so in the 
latter — 1 am amazed that it has occurred to no one (if you except N. Mercator 
- with his quadrature of the hyperbola) to fit the doctrine recently established for 
decimal numbers in similar fashion to variables, especially since the way is then 
open to more striking consequences. For since this doctrine in species has the same 
relationship to Algebra that the doctrine in decimal numbers has to Arithmetic, its 
ons of Addition, Subtraction, Multiplication, Division and Root-extraction 
ily be learnt from the latter's provided the readers to be skilled in each, 


simon Stevin had already decimalised 
n to infinity (as you will see done in the 
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For Newton this was no mere contingent justification. Six years later, in the earlier 
of the two letters to Oldenburg for forwarding to Leibniz, the Epistola Prior ({19] 
Vol.II, p.32]), he says: 


Fractions are reduced to infinite series by division; and radical quantities by ex- 
traction of the roots, by carrying out these operations in the symbols just as they 
are commonly carried out in decimal numbers. 


What Newton had in mind clearly is that a decimally represented integer can 
be thought of as the value of a polynomial when the ‘variable’ is fixed at 10: if 
c; is the entry in the ith place of a number n, then n is the value of >”, cx! at 
x = 10— one has of course to keep in mind that the entries in a decimal number are 
restricted to the digits 0 to 9 and so the correspondence requires the carry over rules 
to be imposed on them when two numbers are added or multiplied. Further down 
(section 5.1), we shall see how just this correspondence is utilised by YB to model 
the algebra of polynomials in one unknown (an arbitrary positive odd integer) on 
decimal arithmetic. 


1.5. A Recursive Proof 


The use of the recursive identity mentioned above is not an essential part of calculus 
per se. It is an auxiliary or enabling tool in much the same way that binomial series 
were for Newton. (Special cases of the infinite series generated by the identity in 
fact coincide with special cases of Newton's binomial series, one specific instance 
being the present case of the arctangent series). Its use leads to a representation of 
each term in the series as a finite sum which in turn will become an integral, the 
integral of a positive even power, in the limit of large n. These integrals are still to 
be evaluated and the next key step forward, that of a recursive proof, makes its first 
appearance here. 

In a daring step announced with little fanfare, YB recognises that the way 
to evaluate the even integrals is i) to consider the integrals of all nonnegative 
powers, both even and odd, and ii) to relate the value of the integral of a given 
(kth) power to that of the (k — 1)th power, i.e., an even (odd) term is obtained 
from the preceding odd (even) term. This is an astonishingly original thing to 
have done. Nothing that went before in Indian mathematics prepares us for this 
step, which should not be surprising since what the step encodes is the principle 
of integrating by parts or, equivalently, the rule for differentiating products. It 


16YB has a very matter-of-fact and pedantic style; none of the light-hearted asides favoured by 
other writers, such as addressing questions and explanations to the slow-witted or the adept — Bhaskara 
II poses one problem to a “ proud expert in algebra” (bije patutabhimánah) — can be found in its 
pages. 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


298 P. P. Divakaran 


is also a conscious application of the principle of induction, in itself a (if not 
the most) sophisticated manifestation of the power of recursive methods. What 
makes it work is that the recursive step is generic: the procedure that generates 
the kth term from the (k — 1)th term does not depend on k. The first step, that of 
considering all k, no matter how large, is thus forced by the desire for an exact 
answer and the resulting recursive nature of the proof. Or, perhaps, it is the other 
way: the impulse is the recursive mindset and it is that which makes an exact answer 
attainable. 

All of this is covered in chapter 6 of YB, which then concludes by treating a 
practical problem raised by (convergent) infinite recursion: how can one extract 
useful numbers from a procedure that is endless? If the successive terms in an 
infinite series diminish in value rapidly, one can, optimistically, be satisfied with 
truncating the series after a finite (small) number of terms, the number depending 
on the rate of convergence and on the accuracy demanded. The arctangent series 
converges notoriously slowly. YB has some quite remarkable ways of accelerating 
the convergence by a process of reordering the terms, achieved by the use of a 
method of recursive correction. It also poses the question of how one may estimate 
the remainder left out by a truncation at the ith term, i.e., the difference between 
the exact value (the sum of all the infinite number of terms) and the sum up to i 
terms. The impressive way this is done, no longer to our surprise by now, is again 
recursive in content but with an interesting modification. A first guess is made about 
its value, which is then fed into the equations satisfied by the remainder to generate 
a ‘truncation error’ whose value (sthaulyam, grossness) is used as a guide in fine- 
tuning the initial guess rather than as the input in the next iterative step. The reason 
is that this would-be recursion within a recursion becomes impossibly complicated 
to implement after the first stage. The step that is iterated is the one of going back 
to the first guess and changing it judiciously. Already in two or three such iterations 
very precise values for z are produced. 


1.6. Solving a Differential Equation 


The bulk of chapter 7 of YB is devoted to a problem that goes back to Aryabhata, 
that of expressing the sine of an arbitray angle in terms of that angle or, in geo- 
metric language, given an arc, to find an expression for half the chord of twice the 
arc. In a famous (semantically nonsensical) verse (Gitikapada 10 of Aryabhatiya, 
abbreviated to AB), Aryabhata sets out the results of his computation in the form 
of a table of sines for 24 equally separated angles in the first quadrant, i.e., in 24 
steps of 7/48. He himself in an opaquely worded stanza (Ganitapada 12 of AB) and 
his immediate successors in more transparent words, Saryasiddhanta for example, 
have indicated the procedure used to make the table. With help from later works, in 
particular YB, the principle behind the construction can be described qualitatively 
as follows. 
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When the angle is close to zero, the value of its sine is close to the value of the angle 
if angles are measured, as is commonly done today, in natural units, namely radians. 
Equivalently, both the angle and the sine can be expressed in sexagesimal minutes as 
Aryabhata did. (This was a nontrivial matter; Ptolemy’s astronomical calculations 
were made exceedingly involved by his partial use of degrees and minutes). To the 
accuracy that Aryabhata worked to, which was a minute of arc, z/48 was small 
enough for this to be true as could be verified by easy computation, for example 
starting with z /6 and halving it repeatedly three times. Thus the first two entries in 
the sine table were filled. The next step required a notion of sine differences, namely 
the difference between the sine of a given angle in the table and its predecessor (this 
was what Aryabhata actually tabulated in his verse), and then, of their second 
differences, namely the difference between two successive such sine differences. 
Elementary circle-geometry then leads to an equation for the second difference. 
The recipé outlined in SaryasiddhGnta is most simply interpreted as arising from 
an approximate recursive solution of this difference equation, terminated after the 
first iteration, which is accurate to the required degree (or, one should rather say, to 
the minute), an interpretation well supported by the relevant portion of YB (section 
7.4.2 of [Sarma])." I am conscious of the inadequacy of this summary of one of the 
cleverest parts of early Indian mathematics but, once again, invite the reader to go 
to the technically complete account in section 4.2. 

The journey from Aryabhata’s small angles and sine differences to the infinitely 
small angles and sine differentials of YB took close to a thousand years and also 
took some false turns.'? But the recognition that the equality of an angle and its sine 


174 clear and early (1795) exposition in English of this material is that of the Scottish geometer and 
historian of geometry, John Playfair [20] (He of the parallel axiom but also, more pertinently for us, 
one who closely followed the work of H. T. Colebrooke on Indian science. Charles Whish, the first to 
write in English on some of the Kerala texts including YB, was also strongly influenced by Colebrooke). 
Only in one respect is Playfair slightly off the mark: he conjectures that "the brahmins" had in hand the 
geometrical equivalent (Ptolemy's theorem etc.) of the addition formulae for the sine and the cosine. 
That is unlikely; what they had was what they needed, the formulae for small increments (see section 
4.2). I am grateful to Roddam Narasimha for the Playfair reference and for making me appreciate its 
historical importance in the reconstruction of Aryabhata’s sine table. 

'8Ganitapada 11 of AB is a strong indication that Aryabhata himself understood perfectly well that 
his procedure was an approximate one and that it could be made more accurate if desired by a finer 
division of the circumference. But, until we get to Kerala, his followers seem to have led themselves 
astray. In his bhasya of AB, Bhaskara I asks the intriguing question: does a circle have an arc equal to its 
own chord? and answers: yes, otherwise an iron ball resting on the floor will not be stable; such an are will 
be (1/100)th (a fraction smaller than (1/96)th!) part of acircle (see [21] for instance). More interestingly, 
Bhaskara II arrived at the correct formulae for the surface area and volume of a sphere by dividing a 
great circle into 96 equal parts and effectively equating 22/96 = x /48 to its sine (see Sarasvati Amma's 
book [6] for the details). We have to conclude, on available evidence, that little conceptual progress 
was made in the time intervening between Aryabhata's cryptically expressed realisation of the need for 
indefinitely continued subdivision (Ganitapáda 11) and Madhavan's full-fledged calculus of the circle. 
I hope E return to the question of the prehistory of Indian calculus elsewhere. 
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is only a limiting property of angles tending to zero, presumably by Madhavan, and 
its being put to use to transform Aryabhata’s approximate table into Madhavan’s 
exact series seem to have been almost simultaneous. There are two parts to this 
transformation. First, Aryabhata's equation for the second differences was shown 
to become exact as the step size was taken towards zero becoming, automatically as 
it were, the second order differential equation satisfied by the sine (and the cosine) 
function — this is calculus pure and simple, exactly as we might proceed today. But 
the recursive solution of the equation now has to be carried through all the way, 
through an infinite number of iterations, to get an exact expression for the sine; 
that is precisely what YB does, resulting in the famous series in powers of the 
angle. 

There are several extremely ingenious steps involved in this progress from 
Aryabhata (finite differences and approximate numerical solutions) to Màdhavan 
(differentials and exact analytic solutions), some of them wonderfully illustrative of 
the flexibility of recursive techniques — most remarkable is an instance of a doubly 
inductive proof which will get its deservedly full treatment later in section 4.4. 
But none of that should blind us to the revolutionary new way of doing mathe- 


matics that made it possible in the first place, the way of calculus that Madhavan 
pioneered. 


1.7. Recursion and Rules of Composition 


The following sections aim to provide the mathematical details of some of the more 
striking uses to which recursive methods are put in YB (as well as some historical 
background where relevant and available), after first taking a closer look at that 
model recursive object, the natural numbers as expressed in the place-value nota- 
tion. The approach will be almost entirely descriptive. There will be nothing here 
relating to the elaborate theoretical framework(s) constructed around the general 
idea of recursion in linguistics, logic, foundations of mathematics, computational 
theory and so on, the main reason for the reticence being lack of knowledge and 
competence. But it is not an unreasonable stance to adopt when dealing with In- 
dian mathematical thought which famously (or notoriously, depending on one's 
epistemological leanings) declined to take categorical positions in regard to issues 
of definitions, axioms, rules of deduction, etc., etc. It is not that Indian men of 
learning were indifferent to the reliability or otherwise of their ways of establishing 
what is true — we have only to read Nilakanthan to know that methods of acquiring 
and validating knowledge, both empirical and theoretical, were a constant concern 
among, at least, astronomers and mathematicians. Nothing was self-evident to these 
inheritors of a tradition, going back two millennia or longer to the time of the later 
upanisads and the $ràmana movement, of questioning and sceptical enquiry. Their 
position appears to have been that an excessive preoccupation with first principles, if 
there be such, was a distraction from the more important business of understanding 
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the world as it is.'? They might have been intrigued but not surprised if they could 
have known that present-day mainstream scientific practice, after having explored 
the axiomatic methodology to its limits, has gradually been turning towards a more 
easy and pragmatic view of such foundational questions; that these are issues the 
practitioners leave to others to worry about or, at worst, think about on Friday 
afternoons. 

Having said this, this very spirit of realism should persuade us to try and evaluate 
the place of recursive methods, in the very broad sense in which we take them, as a 
productive means of creating interesting new mathematical knowledge. In his work 
on the structure of rituals (and, inevitably, its relation to the structure of language), 
Frits Staal ((7], p.42) characterises a recursive rule as one which "can be applied 
to its own output" and so “can produce infinitely many forms". Intuitively, this is 
an appealing definition, general enough to embrace the instances that we are going 
to be concerned with. But when dealing with a developed mathematical science 
from our current perspective, it will need to be both sharpened and broadened; the 
emphasis on infinity, for instance, will have to be relaxed. It can well (and does) 
happen that, for any chosen initial input, the output after a certain finite number 
of steps coincides with the initial input or the input/output of a preceding step. An 
exact recursive solution of the exact difference equation for the sine will be a case 
in point — if the incremental angle is 2z/N for example as it was for Aryabhata, 
the output of the Nth step will be the same as the initial input because of the 
periodicity of the sine function. More particularly, such cyclic recursions will occur 
whenever the mathematical structure we are computing in is defined on a finite set 
(but finiteness is not a necessary condition). It would be pointless to go through the 
same cycle over and over again, learning nothing new, but it is not a serious matter 
in itself (except, perhaps, for a badly programmed computer which has not been 
told when to stop); the interest that a recursively generated result may hold is not 
necessarily in proportion to the number of steps needed to get to that result. 

A more pertinent, broader, issue is that the manner of our response to modes of 
thinking from the past is necessarily subservient to a certain tyranny of progress. 
No historian can fully unlearn what he has learned of subsequent events, however 
conscious he may be of the imperative of placing himself in the epoch. In the case of 
recursive mathematical techniques, this psychological barrier is a particularly diffi- 
cult one to penetrate. The reason is simply that an essential part of the progress that 
tyrannises has consisted in reorienting mathematics towards an abstract structural 
mode of thinking even when the particular problem at hand may arise in a concrete 
context. Such structures are defined completely by rules governing the operations 


-. 19In some of his more philosophical writings, Nilakanthan gives us aclear idea of the epistemological 
moorings of his work and, presumably, that of others of the Kerala school; especially revealing are his 
remarks on the futility of seeking logical first causes. These writings have recently been critically 
examined by Roddam Narasimha in a series of articles which can be retraced from [3]. 
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that can be performed on the elements of a given (structured) set, in particular rules 
of composition associating to every pair of elements precisely one element of the 
same set. Since the rule of composition is defined on every pair of the set, it follows 
that it can be applied to its own outputs over and over again. More generally, a 
problem of interest may involve a collection of such structurally defined sets with 
rules governing *maps' from one set of the collection into another (including com- 
position within one such set), which can themselves be successively applied subject 
to various conditions of compatibility. In short, recursive operations are embedded 
E] in the very genes of modern mathematics, their deployment routine and automatic; 
bi progress has turned the creative into the commonplace. But they do still catch the 
M eye when a description or a proof depends critically and explicitly on the recursive 
vU properties of, say, composition rules. Instances are easy to come by. In the area of 
| | recursive proofs in particular, the very general technique of proof going under the 
| $ name of mathematical induction is without doubt the prime example. Here let us 
rie only recall that inductive proofs proceed by establishing a correspondence between 
' an infinite number of propositions and the recursively defined natural numbers. 

i "The abstract structural vision that animates much of modern mathematics was 
| of course not even a gleam in the eye of Madhavan and the school he founded. Nev- 
; ertheless, in assessing the special role that recursive reasoning played in their work, 
: | we have little choice but to use the same criterion that we would in a contemporary 
| | situation: how good is the mathematics they produced and could they have reached 
their mathematical destination by a path that was not the one they so consciously 
built, one step after another? Were there other more banal, ‘natural’, means available 
to them which would have done the job? The quality of the mathematics is by now 
> - As for the method, I hope a reader who persists to the end will be 
~ convinced th at the techniques they perfected and used so imaginatively and to such 


rean atural continuation of the entire tradition of Indian mathematics 
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own minds or to be communicated to others; no balance, meter stick or clock is i 
required in the process of counting. 

But this is a hard-won autønomy it would seem. By all accounts, early humans 
had no problem with counting small numbers — they ‘took it in at a glance’. Staal ! 
[12] quotes Louis Renou to the effect that in the Rgveda the verbal root khyd, from i 
which comes samkhyd meaning ‘number’, signifies ‘to look’ and that it is only in i 
later vedic times that it acquired a numerical connotation. When it came to larger 
numbers, the early vedic people apparently (going by the cows-for-bricks passage) 
had to resort to comparison and correspondence, not with a fixed standard number, | 
but with numbers chosen by the situation, a sort of ‘weighing’ of pairs of numbers ] 
in an abstract number-balance as it were.?? That this was so is entirely in the nature 
of things; comparison is still the only way we have of quantifying magnitudes such 
as length or time which do not come in multiples of an irreducible, universal unit. 
If the Planck length had a human scale, the physical world would be discrete in the 
large and all measurements would consist of counting; vedic geometry certainly, 
and perhaps even Greek geometry, would not have had to be invented. 

How large had a number to be for people to fail to ‘see’ it? From the historical 
evidence we have, we might guess, not beyond 10—even the Babylonian sexagesimal 
system had a single separate symbol for 10. 

We can take the passage from the Rgveda about Agni, “ He has seen or counted 
(people) like one observes groups of cattle with an owner of cattle" (Staal's trans- 
lation [12] from Renou's French) in two senses: that Agni could see ‘at a glance’ or 
count the number of people as he could the number of cattle, or that he could count 
while others merely saw. In either sense, Agni could count, presumably beyond ten. 
One must assume that, for those who could not, the capacity to match went with the 
capacity to distinguish between ‘more’ and ‘less’ —to draw a meaningful conclusion 
from observing which pan of the number-balance went down — and by how much, 
as long as the difference was, again, small. That would introduce a semiquantitative 
(because of the limitation to small numbers) notion of order among numbers but 
would still not bring a mortal the divine gift that Agni possessed. The | way for- 
ward from here is the obvious recursive one of extending the ordering by r 
comparisons. It is difficult at this point not to jump ahead and ı 


subsequently extended by recursion, that is captured by. 
generality in the succession axiom. But althoug 1 the P ‘ 
complete characterisation of natural age they still 


20r is of course acknowledged that d 
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to take in large numbers ‘intuitively’ or, in practical terms, to do arithmetic; to 
implement this last step we are once again obliged to go back to the primitive idea 
of succession and build everything up from the operation of adding 1 at a time (see 
section 2.3 below). 

“There is no end to the names of numbers; hence we cannot know the numbers 
themselves and their order". To know numbers and to use them, we have first to give 
them names, an identity that will assign each of them its exact place in the order of 
all numbers and that will let us know instantly what that exact place is. For it to serve 
its purpose the nomenclature has to be rule-based — in order to let mere mortals see 
where a number denoted by a name stands in the order — and economical — so as not 
to tax their finite powers of memory. We might well paraphrase Patañjali and say, 
“How, then, must the meanings of numbers be understood? Some work containing 
general and particular rules must be composed". We might go further and formally 
prescribe the “general” rules as the rules designating a number N by the unique 
ordered sequence of atomic numbers (including 0) no, n1, ... which constitute its 


place-value representation, namely the coefficients of the powers of the chosen base 
b, not necessarily 10, in the sum 


N 2no x (D = 1) - ni x (b! = b) -n; x (b =b x b) -..., 


expressed orally or in writing or both. In case the expression is verbal they need 
to include also the rules by which the number is given a verbal identity, which is 
accomplished by a set of names for atomic numerals juxtaposed in the correct order 
by use of joining or compounding rules applied at both the semantic (samdsa) and 
the syllabic (sandhi) levels. (If the representation is written down in symbols as 
in Mesopotamia, names are strictly speaking not necessary, but those who wrote 
and read them probably talked to one another). Thus made tangible, numbers are 
the first, the most primitive, concept in Indian, perhaps all, mathematics.”! “That 
which is the particular study of numbers relating to enumerables (samkhyeyam) is 
mathematics" (YB, introducing the place-value system and the rules of arithmetic). 
As for "particular" rules, the natural candidate is the collection of the laws of 
arithmetic worked out within the place-value representation, including the carry 
over or overflow rules. The particuar rules cannot be given in a concise or even 
practical way without the general rules having been established. How will one teach 


2l'There is good evidence that numbers have priority over geometry in the vedas not only conceptually 
but also chronologically. The Sulbasütra manuals containing the earliest accounts of geometry in India 
cannot be dated earlier than about 700 BCE while number names with a distinct decimal structure occur 
freely already in the Rgveda. For all matters concerning the chronology of vedic tmes, I have relied on 
the synoptic account of the current state of knowledge given in [13]. As for the earlier Harappan culture 
we can say little definite about their numbers (see footnote 23 and section 2.4 below), but a fair picture 
of its geometric sophistication is now beginning to emerge [22]. It is, however, not ruled out that there 
may be a degree of continuity between Harappan geometry and that described in the Sulba texts, 
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or learn, for instance, the multiplication table in Roman numerals except, as with 
Brhaspati and Indra, by endless enumeration? 


2.2. The Power of 10 


One begins, then, by giving names to the atomic numbers 1 to 9 (in the decimal base 
case), eka, dvi, - - -, nava and the base itself, 10, dasa. Ideally, 11, 12,..., can now 
be designated by an application of the joining rules: one-plus-ten (ekadasa), two- 
plus-ten (dvadasa) and so on till we get to 20 at which point, a different joining rule 
should come into play to create names for two-times-ten and similarly for 30 and so 
on. Combining these two joining rules, we have a system that, in principle, names 
all numbers up to 99 and it requires exactly ten distinct names and the convention of 
the two joining rules to be remembered. When it comes to 100, we have options. The 
first is to express it as the sum or product (or a combination of the two) of smaller 
numbers in very many different ways using the supposedly already acquired rules 
of arithmetic, an option that is available for numbers less than 100 as well but 
generally not used. Apart from the lack of uniqueness and the consequent absence 
of a repeatable structure in the naming rules, such an ad hoc system is bound to reach 
its limit sooner or later and new number names and more ad hoc rules will have to 
be introduced. The preferred Indian way was to give a new name to every power of 
10 so that, with minimal demand on resources, the naming method that works up 
to 99 can be indefinitely extended. It is also a fully recursive approach to naming 
and, so, to counting — one starts with a finite ordered set of names for the lowest 
nine of the ordered set of all numbers and two linguistic rules of composition (the 
joining rules); the repeated application of those rules together with progressively 
less frequent new names for the powers of 10 generates the decimal nomenclature 
of every number. 

That is the ideal. In practice the ideal was never fully realised. It is not in 
the nature of natural languages to be completely logical and Sanskrit, vedic or 
later, is no exception. In the event, there have always been deviations from the 
ideal in the way numbers were designated, most markedly in the proliferation of 
synonyms for the same number, often arrived at by decomposing it as sums and 
products in different ways.” At the same time, astronomers and mathematicians 
from Aryabhata onwards were constantly on the lookout for a rule-based, symbolic 
but verbalisable representation that would not lead to very long names for big 
numbers and so not inconvenience the versifiers unduly. The goal was achieved 


22The book of Datta and Singh [9] has an extensive discussion of number names and number symbols 
and of possible reasons for some of these deviations. They suggest in particular that the use of different 
numerical expressions for the same number as well as that of bhitasamkhya, which is an association of 
numbers with certain conventional multiplets (3 worlds, 8 directions, 9 planets (graha) and so on) has 
its origin in the exigencies of Sanskrit versification. 
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finally with the invention and widespread use of the ingenious katapayddi notation, 
especially popular in Kerala. Its basis is a systematic one-to-many map of the atomic 
numerals into the consonant syllables that takes advantage of the organisation of 
most of the Sanskrit (and Malayalam) consonants in groups of five, the latter then 
j | being used in a place-value format, turning numbers into mnemonic ‘words’ which 
could generally be contrived so as to give them a sensible meaning. Thus a given 
number has many verbal representations, giving the system a welcome flexibility. 
What matters more is that the inverse map from a *word' to a number is well defined 
HEIN and easy to figure out; in particular, the syllabic length is exactly the number of 
‘places’ (examples: ma-tu-lah = 365, dhi-ja-ga-nnü-pu-ram = 210389, both from 
ut H YB). It is also properly recursive and comes closest to the abstract ideal, namely 
H A a purely symbolic representation, without need or room in it for names for any 
| | number other than the atomic ones (including for powers of the base), even for 
oral communication — one might say that with the katapayddi, Indian mathematics 
i finally settled its dues to the Sanskrit syllabary. It is then something of a surprise to 
111 find it used so very sparingly in YB, written at a time and place where it was all the 
2 rage. Perhaps this was because YB was free from the metrical discipline of verse. 
What it prefers to use instead are the common Malayalam (and Tamil) number 

| names, more systematic than Sanskrit-based ones and still in currency.” 
} It is true that many of the points made above are not directly relevant to a strictly 
sy symbolic written representation such as the Babylonian sexagesimal system. In 
7a order to read their numerical tablets, we do not need to know what names the 
Babylonians gave to their atomic numerals | to 59, leave alone what joining rules 
they had. We do not even need to know whether they had names for their numbers at 
all. .Itis nevertheless an amusing exercise to compare bases: what relative advantage 
E or drawback can be discerned in the choice of a particular base, 10 or 60? Whether 
itten or spoken, there is no escaping the requirement that each atomic numeral 
o A es individual name or symbol which will have to be memorised. The 


( l up t ‘and then a new Silo! for 10, similarly Suspen for 

may take as having been devised mainly to overcome the 
‘names ey must have had use a similar strategy?) Did the 
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ts state. of historical knowledge, the question has no 
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satisfactory answer. Itis also probably moot; the very emphasis on naming points to 
a more likely source of inspiration, the proximate one of the phonetic organisation 
in the vedic period of the Sanskrit syllabary. It may in fact very well be that both the 
number names and the syllabary have a parallel early vedic origin and evolution. 

In the Indian oral system, one had to have names also for the powers of 10; how 
many were needed depended on how high one wanted to count. It is completely 
elementary to see that, to count up to pararddham (10!"), one needs 27 (10 for the 
atomic numbers and 17 for the powers of 10) distinct decimal names, to be combined 
by the joining rules, and 69 (60 + 9, 60? < 10!7 < 601?) sexagesimal names; to 
count up to 107, the corresponding numbers are 17 and 64 respectively. The decimal 
system wins simply because of its far smaller number of atomic numbers. This may 
appear a frivolous reason to choose 10 over 60, but in a dominantly oral culture, 
it is not to be dismissed as irrelevant. And when it comes to doing arithmetic, the 
advantages are more concrete. For instance, once the place-value multiplication 
procedure is put in place, a major achievement in itself, one has only to learn “by 
heart’ the multiplication table up to 9 x 9, something we all did in primary school, 
and then just go by the rules. The basic sexagesimal 59 x 59 multiplication table 
cannot ordinarily be memorised by mortals. As is well known, the Babylonians 
overcame this obstacle by using formulae like 


(m +n)? — (m — ny 
4 


mxn= 


and preparing written master-tables of squares. 


2.3. Arithmetic 


The fundamental arithmetical operation in YB is the addition of 1 together with its 
inverse, the subtraction of 1. All the eight operations it explicitly describes, general 
addition and subtraction, multiplication and division, taking squares and cubes, and 
taking square roots and cube roots, are built out of these two basic ones by, it is 
needless to say, their repeated application: “Addition is of use in multiplication, 
multiplication in squaring, squaring in cubing. Similarly, subtraction is of use in 
division, division in [taking] square roots, square roots in [taking] cube roots. Thus, 
the preceding [operations] will be of use in the succeeding" ([Sarma] 1.3). (Theeight 
operations are first classified as those which increase and those which diminish). 
Addition of two numbers m and n is, as expected, explained as the n-fold repeated 
addition of 1 to (in more modern words, repeated applications of the successor 
function) and similarly for subtraction. There is no explicit reference to negative 
numbers in this (first) chapter; what to do with n — m forn < m does not geta 


mention. 
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Multiplication of m and n is then defined, also as expected, as the addition of m 
to itself n times. (“Then multiplication — it is just addition if we reflect [on it]"). The 
only slightly surprising point arises in relating division to subtraction, accomplished 
by falling back on geometry in the time-honoured way. Imagine the dividend n to 
| i be the area of a rectangle one of whose sides is the divisor m. The quotient is 
"T then the maximum number of rows or columns, i.e., n x 1 rectangles, that can be 

Wut removed from the whole figure: n/m := l such that n — / x (m x 1) = 0. Again, 
dii it is tacitly understood that n has to be a multiple of m; fractions are not mentioned 
ise here. 
Li | All of this is totally in conformity with the Indian approach to the logic of 
arithmetic from much earlier times. In particular, there is no sign of any sympa- 
| thy for abstraction as might have been expected 400 years after Bhaskara wrote 
Bijaganita. Even by implication, there is no hint that subtraction of and division 
} by arbitray positive integers will automatically force the extension of the natural 
numbers to include negative integers and fractions. Fractions in their own right 
make their entrance soon enough, in chapter 3, where, for example, division by 
1T n/m is carefully shown to be the same as multiplication by m/n. But we have to go 
| deep into the calculus of the series for z to see the first algebraic statement about 
| negatives, a citation of Brahmagupta's well known aphorism and its generalisation 
| to indeterminate numbers (“quantities (rasi) whose numerical values (samkhya) are 
h unknown"). The terms in the z series are given by a recursive rule and they alternate 
i in sign, so this belated recall of well-established principles and their easy adaptation 
to variables was inescapable. It is certain that there was no fear of negatives, there 
had not been any for a long time [11], but they were still, to facilitate computation, 
handled with care (*Here, if an ee uate number (rasi) is of negative nature, it 


—  —R 


aid of tokens, [Sarma] 6.8, p. 75). 
ually clear that there was, if not fear, a degree of wariness regarding 


series, the significance of which was well understood by Nilakanthan. 
si ve $ 

bed for extracting square and cube E of m 
answers will be whole numbers. Later, in the chapter on 
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The fear of the irrational may provide an indirect explanation for another en- 
during mystery: the absence of a decimal representation for fractions in Indian 
mathematical work. It would have been a natural and undemanding thing to have 
done, as demonstrated convincingly by the Babylonians. Can it be that the avoidance 
of decimal fractions has something to do with the fact that even simple fractions 
can have decimal expressions that are actually, not just potentially as in the case of 
whole numbers, nonterminating? 

There is an irony here. When it came to geometry, there was no way to keep 
away from square roots since one of the two pillars of Indian geometry, the theorem 
of the diagonal (Pythagoras’ theorem; the properties of similar triangles form the 
other pillar) cannot possibly dispense with square roots. In the early part of chapter 
6, YB describes the (infinitely iterative) procedure of getting at the circumference 
from the diameter by starting with a circumscribing square and repeatedly doubling 
the number of sides through circumscribing regular polygons. (It is a good guess 
that Aryabhata found his approximation to z in this way). The procedure is of 
course riddled with square roots. It must have rankled. Before starting on the first 
steps that will finally lead to the z series, YB announces: “After this, [I] describe 
a way of producing the circumference for any desired diameter without having to 
take square roots”. With such undramatic words were his pupils initiated into the 
new-born discipline of calculus by Jyesthadevan. 


2.4. Origins? 


Let us conclude this bird’s-eye view of the grammar of numbers with a brief return 
to the question of origins and possible influences. The natural, in fact the only, math- 
ematical candidate that might have served as a model for the decimal place-value 
‘language’ — it was a language, orally expressed, with its own grammatical rules —is 
the Babylonian sexagesimal system.”* The mathematical apogee of Mesopotamia, 
roughly 2000-1600 BCE, was contemporaneous not with the vedic but with the 
earlier Indus valley or Harappan civilisation in its late phase. Some vestiges of west 
Asian civilisations have been found in Indus valley sites, but they are so sparse 
and inconclusive as to be useless for our purpose. The reverse flow is somewhat 
better authenticated, in the form of a number of seals, sealings, etc., unearthed in an 


24The possibility of a Chinese influence is definitely discounted by the profusion of number names 
reflecting an evolved decimal structure in the Rgveda, including in its earliest mandalas, a fact already 
alluded to several times. (A forthcoming article in collaboration with Bhagyashree Bavare will look at 
the evidence in detail). The earliest Indian references to China are related to commercial activities and 
do not predate the 3rd century BCE (in the Arthasastra). Long before that time, not only a perfected 
decimal counting system but also fairly advanced arithmetical procedures using it (the Sulbasütras of 
Baudhayana and Apastamba) were well in place. For a quick tour d'horizon of Sino-Indian intellectual 
engagement, which flourished most vigorously in the first millennium CE, see Amartya Sen’s essay 
“China and India” in [24]. 
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area spreading from the Arabian coast through the mouth of the Tigris-Euphrates 
into modern Syria. As for the Indus culture within its native domains, the ruins 
and the excavated artefacts are witness to a high degree of geometrical and met- 
rical sophistication, extending from the construction of their cities and the varied 
symmetric patterns on a fairly extensve sample of pottery, seals etc. [22], to the 
Pi very accurate regularity of the weights. The latter in particular persuade us to con- 

i clude that it was a predominantly binary culture (or one based on a low power 
2 of 2), the weights doubling in steps from 1 to 32 units, with weak indications of 
a decimal progression as they got heavier. Then there are the signs showing the 
supposed numerals 1 to 7 by vertical strokes, giving rise to the speculation al- 
| ready cited (see footnote 23) that counting was octal.?? All in all, there is little that 
indicates a special role for the number 60 in the greater Harappan culture in its 
| ! evolved form over its entire duration (of at least a millennium) and extent (from 
| bee? modern Baluchistan to the Indus river basin, to the river Yamuna to the east and 
across Saurashtra). Nor is there any such evidence for the vedic culture? that suc- 
ceeded it in northwest India and then spread quickly into most of the Ganga river 
basin. 

But it is fair to say that Harappan research is still in an early consolidation phase. 
Despite the vigour that animates the field, going beyond the material remains to a 
deeper insight into the culture has proved to be an elusive quest; we are nowhere 
near getting inside the minds of the Indus people and we will not get much closer 
as long as their writing remains unread. One can always hope that there will be a 
breakthrough on the decipherment front one of these days and that the writing will 
then turn out to contain information about the west Asian civilisations with which 
they had close commercial ties. While waiting for that to happen, what is one to 
"i The course that recommends itself is to concentrate on whatever we can learn 
from what we do have. There is no question that our understanding of the vedic 
inte Ed gacyi is much richer and deeper compared to the sketchy and uncertain 
pi r are likely to have in the near future, of the Indus culture in its 
ts. That rich vein is far from exhausted. From what we know 
it and Panini's grammar and from what we can learn about the 
h to numbers from even as late a text as YB, it does not seem to 
'tched idea to look for links between the two.” Indeed, already at the 
ODD are striking similarities. Sanskrit organises its syllables 
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in a well-ordered sequence, the vowels first and then the pure consonants in five 
groups of five each (plus a few outliers which don't fit the pattern), the order of the 
groups depending on the part of the vocalising organs utilised in articulating them, 
progressively from the back of the palate to the lips, see for instance [25]. (There 
is also a well-defined order within each group). The genesis of this organisation 
goes back to vedic times, its final systematisation coinciding roughly with the era 
of Panini [25]. It is easy to see in the pure syllables an analogue of the atomic 
numerals; there are rules for combining them to produce all legitimate sounds of the 
language which are in turn used to formulate rules to combine clusters of sounds or 
‘words’. In the context of phonology, the vowels and the simple consonants are “as 
if’ prakrti (see footnote 28 below), all other sounds being construced out of them. 
(The ordering of the syllables and the combination rules are of importance in the 
katapayadi mapping of numbers into verbal expressions; for instance, the vowel 
value of a syllable has no numerical significance, just like all but the last sound of a 
compound consonant). 

Perhaps, rather than think of Sanskrit grammar as a possible imperfect model for 
the Indian number system, we should open our minds to the possibility that both are 
parallel manifestations of a particular intellectual mindset, specific to the time and 
the place. Perhaps there was a mathematical Panini, perhaps he even wrote a book 
entitled Dasadhyayi, alas lost forever. 


3. Varieties of Recursion — the Arctangent Series 
3.1. Generalities, the Notion of Samskaram 


The “doctrine of decimal numbers” which gave confidence to Newton in the han- 
dling of infinite series in his early work in calculus had been a constant presence 
in Indian mathematics from at least the vedic era. When the Kerala mathematicians 
found their own route to calculus, two and a half centuries before Newton, they 
were the inheritors of a decimal culture that was seeded two and a half millennia 
earlier and had grown to healthy maturity in the following thousand years or so. 
Though it is not in the nature of Indian mathematical writing to spend much time 
on motivation and history, the influence of decimal numerical thinking is so evident 
in all of it that it is commonly characterised as fundamentally computational. So it 
is not only in the Kerala school’s fearlessness of infinite series (and of the idea of 
infinity itself) but in the totality of their mathematics that we come upon the per- 
vasive undercurrent of recursive construction that is at the core of the place-value 
representation of numbers. 

Recursive construction generally figures in Kerala writing on calculus in the 
guise of a samskaram (refining; occasionally, other synonymous terms are also 
employed). The result of a process of samskdram is something that is samskrtam, 
that which is or has been refined. This is also the name of the language in which 
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the writing itself was done, Sanskrit, as distinct from prakrtam or Prakrit, that 

which is natural.?? There are variations in the details of how mathematical refining 

is carried out in different situations, but the general idea in its simplest variant 

can be summarised as follows. In a given problem (evaluating a geometrically 

defined quantity, solving an algebraic or differential equation, etc.), i) use insight 

1 | and ingenuity to make a first guess of the answer, generally incorrect; ii) use the 

formulation of the problem itself (i.e., in ignorance of the correct answer) to obtain 

i an exact expression for the difference between the exact answer and the first guess 

that displays the (unknown) exact answer in a practically usable form; and iii) 

repeat the process by substituting for the exact answer from the expression obtained 

in step ii), which will result in an updated guess displaying, again, the unknown 

' answer, and so on. Stated like this the procedure may appear both pointless and 

EE | vague. Slightly less vaguely, suppose the first guess for the unknown answer x is xo 

: and the difference x — xo is a known and easily computable function f of x. Then 

i X = xo + f(x) = xo + f(xo + f(x)) = xo + f(xo + f(xo + f(x)) = --- Gn 

4 practice it may be necessary to approximate f in some clever way). The examples 

| from YB that we are going to be dealing with will illustrate the variations that are 
| possible within this general framework. 

| But, before that, we may already take note of some of the qualitative features that 

zs emerge from these examples and illustrate how they vary from instance to instance. 

| 4 First, the procedure may be difficult to carry through beyond one or two iterations 

1 and so may have to be terminated, resulting in an approximation, often characterised 

i as gross (sthūlam) or fine (suksmam), to the correct answer. In one case where this 

happens (in the problem of estimating the truncation error in the z series), YB goes 

back and plays around with the initial guess so as to get a better approximation. The 

f d possibility is that the procedure leads to the exact answer either in a finite 

f steps (finite recursion, the most sophisticated example being kuttakaram 

h ‘pulveristaion’ y? or ‘by leading back on itself (cyclic recursion, as in 


ur Sree an the original]", followed by an explanation of 
mere I have called ; atomic numbers to successively higher powers 
on becomes the prakrti for the next one. Tampuran and Ayyar [5] 
general connotation of the term prakrti in the sastra literature 
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the putative exact solution of Aryabhata’s difference equation for the sine). The 
third and final possibility is the most interesting: that the recursion does not end but 
has a sufficiently regular structure for it to be carried through ad infinitum (infinite 
recursion). The resulting series are the most celebrated fruits of the Kerala calculus, 
but the main emphasis in the present article will not be on the series per se, nor on 
the deeper methodological advance (the infinitesimal method) that led to them, but 
only on the recursive element. 

To illustrate some of the issues specific to infinite recursion, it is enough to refer 
to the first two examples from YB, the z series 


and its generalisation to the arctangent series for an arbitrary angle 0 lying between 
0 and x /4, 


tan?@ tan? 

3350915 
Infinite series such as these, arrived at by a formal recursive procedure, run the 
risk of not converging to any finite answer. Whether by insight or otherwise, the 
Kerala mathematicians invariably avoid this ever-present pitfall — it is trivially easy 
to produce recursively constructed series which do not converge, as the following 
subsection will illustrate. The question of convergence is a particularly sensitive one 
for the arctangent series since the variable tan @ is not bounded. It is quite obvious 
from the paragraph from chapter 6 of YB explaining how to go from the z series to 
the arctangent series that its author was aware of this and knew what to do about 
it. The first sentence, “Here [I] explain how to find the arc [ RÓ] corresponding to 
the bhujajyà [R sin 0] or kotijya [R cos6], whichever is smaller" ([Sarma] 6.6), 
makes it clear that what is sought is not a formal expansion of 8 in powers of 
tan @ for the entire range of values of (it is enough to consider angles between 0 
and x /2 which is what YB does) but rather a convergent expansion in the variable 
appropriate to a given subdomain of values of 8, i.e., that the tan@ expansion is 
the correct convergent one for @ in the first octant, sin@ < cos@. The last sentence 
of the paragraph confirms that when cos@ < sin@, the appropriate variable is in 
fact cot 0. (The full paragraph is part of section 6.6, to be read with Fig. 6.6 of the 
explanatory notes, of [Sarma]). If this is not sufficiently convincing evidence of the 
concern that the series should make sense, we have the whole of the very elaborate 


0 = tan — 


whole chapter (5) in YB. The solution is a recursive one, involving the use of the Euclidean algorithm 
in a numerical (integral) form. It is very well explained in several places, e.g., [9], the epilogue (B) of 
[Sarma] and most thoroughly, especially in its connection with continued fractions, in the appendix (in 
English) to [TA]. For this reason, and since it has no direct linkage to any other of the recursive tools 
employed by YB, I shall have nothing further to say on the subject here. 
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discussion at the end of chapter 6 (sections 6.8, 6.9 and 6.10 of [Sarma]) of methods 
of accelerating the rate of convergence and of estimating truncation errors which 
would make no sense unless one knew that the series was convergent. 

There is also in YB an instance to illustrate the other side of this question as it were: 
j i Madhavan's interpolation formulae for the sine and the cosine. They are arrived at 
MES by a novel variant of samskaram dealing with two quantities (sin(@ + 6) — sin@ 
and similarly for cos) simultaneously and so provide a very interesting example of 
recursive techniques. But, unlike in the case of the power series around 0 for the 
same functions, YB treats the application to interpolation briefly, not going beyond 
the second order correction and dismissing the subject with a terse remark on how 
to improve the accuracy of the approximation by two more orders — it does not 
refer to the possibility of repeating the process indefinitely. And that is a good thing 
because, if it did, the result would be an infinite series for (sin / cos)(0 +ô) in powers 
of ô, a would-be Taylor series which is not the Taylor series because the coefficients 
are wrong. 

The other issue, related to this but more philosophical than technical, is one 
which seems to have caused some confusion among scholars. It is this: assuming 
that infinite recursion gives a convergent series in a given problem, did the Kerala 
mathematicians themselves think of such results as a series of successive approx- 
imations only, or as results which are in principle exact when ‘all’ the terms are 
added (“in the limit’ as we would say), but which for practical purposes will have to 
be terminated at some point? Several historians have subscribed to the first view in 
recent times, sometimes explicitly. Partly, the confusion is due to the excessive em- 
phasis put on these series in the early writing in English (and, often, even today) on 
the Kerala work, to the detriment of the driving philosophy behind it, that of turning 
Aryabhata's approximation methods into a technique for getting exact answers by 
going infinitesimal.*° But a reading of YB leaves no room for doubt that this is a 
mistaken view. Chapters 6 and 7 are liberally strewn with remarks which either say 
Sp ecifically that the: results expressed by the appropriate series are accurate or exact 
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where these issues have already been confronted and settled). The conclusion is 
reinforced by YB's silence in an aparently similar context, the interpolation formula 
referred to above; it is as though the power series and the numerical series for x 
were recognised to be exact in the limit but the interpolation series would not be 
and was merely a means of approximation.?! In fact, evidence for the awareness of 
the distinction between an exact state of affairs and approximations to it forced by 
the demands of practical utility is already present in the very beginning of YB, in 
connection with the infinitude of numbers and the place-value representation of any 
arbitrary (finite) number. The sentence I have quoted earlier on the impossibility of 
knowing and naming numbers is followed immediately by: "Therefore, for the pur- 
pose of making use of them (or working with them, the Sanskri/Malayalam word 
used is vyavaharam:; "them" means the infinitude of integers), think as follows. The 
first place is for the numbers from one to nine. Then the place of all these multiplied 
by ten [is] the second. Let that be to the left. The place of one, the place of ten, and so 
on, are their names. That is the nature of numbers" ((Sarma] 1.2). This is as good a 
characterisation of the distinction between the (recursively defined) axiomatic and 
the (recursively constructed) arithmetical view of natural numbers as any we can 
think of. 


3.2. The z Series and the Arctangent Series as Rectification 


There are now available in English at least two detailed and faithful annotations 
of the yukti as described in YB of these two series, Sarasvati Amma's book [6] 
which has been around for a while now and the very recent explanatory notes [RSS] 
appended to Sarma's translation. With the assurance that the reader can fall back 
on these references for the details not discussed here, we can focus on just the 
recursive elements in the contents of chapter 6 of YB in which these series are 
exhaustively treated. They are abundantly supplied with diagrams, generally based 
on the Tampuran-Ayyar critical edition, and that is also a help: on the occasions 
on which we need to look at the specifics of the geometry, I can simply refer the 
reader to the appropriate figures from [RSS] rather than reproduce them EE J 
Also, since the order in which YB obtains these results — first e 
for m which is then generalised by a use of the properti i 
has no special interest from the recursive viewpoint, iti 
inconvenient) to stick to that order. iA 
These series are answers to what in Europe 
problems: one is required to find the length o 
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the Cartesian way or as the trajectory of a point under a prescribed motion. (In the 
present case, the curve is the circle of unit radius). The general approach used in YB 
to get to the answer is a precurser of what came to be called Riemann integration: 
divide the relevant interval of the x-axis (the independent variable, in our case tan 0) 
into n equal? segments, estimate the length of the curve corresponding to one 
I EI segment of x by linearising it locally (here, by approximating the corresponding 
} segment of arc by half the chord of twice the arc, i.e., by replacing the angle ô 
te subtended by the ith segment at the centre by sin ó;), add up the n contributions 
| | and, finally, let n increase without bound (‘in the limit n — oo"). The limit makes 
| the linear approximation exact, sind; — ô; as 6; — 0 and, moreover, allows the 
HERI systematic neglect of terms of order 1/n in the sum as n is taken to infinity. The 
! linearisation is of course the heart of the matter, the critical conceptual step beyond 
Arybhata's finitistic method. There are only two ways in which the procedure differs 
| from what we teach today in a course in calculus, one minor and the other not so. 
| The minor point is that the modern positive x-axis is, in the Kerala (and Indian) 
| convention, the direction east which, in a diagram on sand spread on the floor, 

| 

| 

f 

i 

$ 


points away (and in the very rare diagrams on palm leaf, up), and the first quadrant 
is the E-S quadrant — so reflecting a Kerala circle through the N-S axis produces 
our conventional graph of the equation of the circle x? + y? = 1 (see Figs. 6.3 and 
6.5 of [RSS]). The more interesting difference is that the work is carried forward 
as far as it will go before the n — co limit is taken, resulting in the uncovering of 
some striking, essentially arithmetical/algebraic, identities. The simplest of them 
is a finite counterpart of the formula for integration by parts (a simple special 
case of the Abel resummation formula, see section 3.4 below), a key step in the 
ctive evaluation of the integrals of positive integral powers. I draw these obvious 
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This is an exact result valid for any n. In the first instance of the use of then — oo 
limit, YB replaces 1 /d;_\d; by (1/2)(1/d?_, +1/d?). This is one of the very few steps 
for which YB provides no justification, but it is easy to see that the two quantities 
are equal to the first order in (dj — d;)/dj_). Thus S, is approximated, first by 


Len 1 1 
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with d; = | and d? = 2 (normalised radius), and then by 
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which is ignored as n — oo. This last step is given a justification in the words, 
“As the segment of the side [the tangent] becomes smallfer], the one-fourth part 
[—1/4n] can be discarded" ([Sarma] 6.3.2), leaving no room for doubt about YB's 
appreciation of the power of the limiting operation: S” will tend to an eighth. part of 
the circumference (= 7 /4) and its expression in terms of the diagonals will tend to 
the z series. UIT T 


3.3. The Elementary Recursive Identity 
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| valid exactly forallk = 1, 2, - - - . The kth samskaram will consist simply of putting 
i a = b in the last factor of the last term. 

t It is obvious that the utility of the procedure depends on what kind of a handle 
| we can get on the quantity a — b. In the problem of the z series, a = d? is the square 
| of the hypotenuse and b = 1 a square of a side of a right triangle. Soa — bis 
' Severe of the third side, a — b = i? /n?; the kth recursive identity for 1/d? then 
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paragraph: “In this way, if [we] divide all [the terms] by the multiplier ((radius)? (= 
1 in our normalisation); i.e., if 1/d? on the right is systematically replaced by the 
whole of the right side), the sequence of corrections ($odhyaparampara) will not 
end until [we] divide by the divisor (i.e., d?). If [we] do not divide by the divisor, the 
sequence of corrections will not end. [All we can do is to] discard it (the correction 
term) when it becomes very small” ([Sarma] 6.3.3). 

Apart from a precise formulation of how an infinite series is recursively gen- 
erated — it will serve as a template for more complicated series later — there is 
enough in this passage to clarify several issues. First is the virtue of displaying 
the unknown answer in a way enabling repeated substitutive recursion ad infini- 
tum. The demands of practical utility (vyavaharam) are met by the suggestion to 
terminate the sequence when it is deemed sufficiently accurate. But what stands 
out most is the dramatically repeated message that there is no need to end the se- 
quence, that the exact measure of the circumference, in other words the value of z, 
can only be given by an infinite series. The message is reinforced in the opening 
lines of the following paragraph (see section 6.3.4 of [Sarma]) describing the even 
and odd terms with their alternating signs whose sum, in the limit n — oo, will 
become the exact circumference.** And it is difficult to read this passage with- 
out calling to mind how Nilakanthan interpreted Aryabhata's description of his 
own fractional value for z as asanna (approximate), that it really means that z is 
irrational.?? 

AII this leads directly to the question of convergence, particularly easy to deal 
with for the arctangent series because the recursive identity, infinitely iterated, 
results in a geometric series: 


b p? p? 

with —(a — b)/b as the common ratio. So, while the identity is trivially true at every 
stage of iteration for any values of a and b, the infinite series converges if and only 
ifa —b < b (in the present instance; < n which is guaranteed; what does not work 
in the second octant is the geometric construction, as made clear by YB). Let us 
recall that itis in dealing with similar situations requiring the integration of functions 
which are not powers that Newton took recourse to formal binomial expansions. The 
binomial series for the inverse first power, which too is a geometric series, coincides 
with the iterative series, but a binomial expansion for fractional or general negative 
integral exponents seems to have had no use for the Kerala mathematicians. But 


the present one, the context helps to remove any ambiguity. The Malayalam-to-English glossary at the 
end of [TA] translates sodhyaphalam as "correction to be applied to a result". 

Jtt is this kind of insight that we are deprived of in the Sanskrit verse texts, see footnote 30. 

357 have suggested elsewhere [16] that the irrationality of x may have acted as an immediate motivator 
for the development of the z series. 
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if the need had arisen, they probably had other effective techniques to hand, for 

instance a form of iterative approximation that appears to have been used already 

in the Sulbasütras and subsequently in the Bakhshali manuscript to extract square 

roots. There are interesting recursive issues relating to these formulae which I take 
| up briefly in section.4.6 where we will see that the Sulbasütra formula may well be 
the first recorded instance of the use, in an arithmetical computation, of the general 
method of samskaram. 


A 3.4. Recursion as Proof, Induction 


Having shown that S% — S, — Oas n — oo, YB proceeds to demonstrate how 


S = S" := limpo S7 can be evaluated. This is done by taking the limit of each 
term in the sum 


— 


Sj = Io Ina Ina + °° 


— mtm mans 


memos 2 1 
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— MEET TOEIC rm eon 


fork — = 0,2,---. The computation of J, :— lim, —o0 Jn, in terms of which the 


T 
a 


mr in the book. The contents are worthy 


mpasses t the first results that can be considered 
ns w hi ic h had never been posed before, namely 
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a prefatory sentence: “Even though it is not useful here, [I am] describing also 
integrals? of equals multiplied three, five, etc. times as they occur in the midst of 
those which are useful” ({Sarma] 6.4), YB first evaluates /; from /o(— 1) and h 
from /, in detail, then 73 from Jh and /4 from J; progressively more briefly, “by 
virtue of the reasoning given earlier". A final passage gives the general rule: “To 
make integrals of higher and higher powers, multiply the particular integral by the 
radius and remove from it itself divided by the number which is one greater (for 
unit radius, this amounts to /,4,1 = J, — 1, /(k + 2))" ({Sarma] 6.4.4). However, in 
order to highlight the recursive content, I compress all of this?” into a description of 
the general case of evaluating /,,, from Xg, following faithfully YB's own steps in 
going from Jo to J; to 75. 

The prior announcement about the need to consider odd values of k, “though 
not useful here", already offers a hint that it was an essential step in the proof and 
that the proof will turn out to be inductive. The details bear this out. True to its 
invariable practice, YB carries out as much of the procedure as is practicable for 
finite n, before taking the limit. In doing this it is a simplification to drop the powers 
of n from the definition of the integrals (samkalitam) and to focus on the sums of 
powers: 


n 
Ave — oe 


i=l 


The first step in evaluating A, , is none other than the first stage in a general process 
of samskāram as I have described it in section 3.1. One makes an insightful first 
guess B, , by replacing one factor of i in each term by n: 


n 
Bnk :=n > ilz nÅn,k-1- 


i=l 


The ‘error’ is 


n—-l 


Bnk — Ang = > (n — Di. 


i=l 


36The word employed is samkalitam, which is used in this part of the book for both 7, and the 
limiting quantities 7%; see footnote 33. 

37The thought naturally arises that Jyesthadevan was aware that this was an unprecedented sort of 
yukti and so was presenting his reasoning with the care and attention to detail that it called for. On the 
value placed by the Kerala masters on consensus as a means of validating a yukti, on convincing their 
peers and disciples of its correctness, see [3]. 
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A simple reordering of the sum on the right side now leads to the identity 


n—| 


n=l n=l i 
BORD E l =D An 


i=l i=l j=l i=l 


so that the difference between the exact value and the first guess can be expressed 
as 


n—1 


Bnk ar An.k = 2x. Aj k-1- 


i=l 
So 


n—l 


Ank = nAn k- — > Ait. 
i=l 


This is the key relation. In contrast to the application of the elementary recursive 
identity, however, Ba, = nAn 4. becomes a ‘known quantity’ only after the sum 
of the (k — 1)th powers is computed. Moreover, the difference does not display A, 
itself in a nice form amenable to a simple iteration but only as a linear combination of 
sums of the (k — 1)th powers of numbers upto n — 1. A simple iterative computation 
eee and hence of Jn, and J, is therefore out of the question. But it opens 

oor for an inductive evaluation since it allows Tk to be related eventually to 


th the case ee k as amodern inductive proof 
z n ain <k+1. Eu 
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For n — oo, A, k44 tends to A, 144; so putting it all together, we arrive at 


1 n? 
LF SPA ; 
( zx) XT EZB 


Or, 


By restoring the powers of n in the definition of I, x, it is easy to rewrite this in the 
form I, = kIy,/(k + 1), i.e., I = 1/(k + 1). 

Once again, there are methodological insights to be drawn from this first-ever 
evaluation of integrals of arbitrary positive integral powers. To begin with, there is 
the reliance on an enlightened first guess as an almost automatic initial step and on 
its subsequent refining. But instead of leading to a simple iteration, the purpose it 
serves here is as the starting point of a procedure that ends in relating a quantity 
indexed by k, Ig, to 4-1, in other words a genuinely inductive procedure. We have a 
hint that this is how the calculation of I, was meant to go in the deliberate foregoing 
of the use of the known values of /, 4 to evaluate J; fork upto 3 (even the sum A, 1 of 
the arithmetic series of natural numbers is not cited); the yukti is designed to cover 
all k as the passages cited earlier suggest. The hint is reinforced by the terminology. 
The integral J; is called milasamkalitam, the root from which the samkalitams of 
all higher powers are generated recursively (except in one place where it is referred 
to as kevalasamkalitam, simple integral, to dstinguish it from mutiple integrals). 
Related to this is the painstakingly exhaustive exposition of the entire computation, 
letting the student/reader see at every step that the method is logically rigorous — 
there is no guesswork here.?? 

A final insight concerns the reordering (or resummation) identity which I write, 
generalising slightly its form as used in the study of 7,4, as 


n-l n-l i 
S a-ifi= SDi 
i=l 


i=l j=l 


39Compare Newton on the evaluation of the coefficients in the sine series: “Let it be noted here, by 
the way, that when you know 5 or 6 terms of those roots you will for the most part be able to prolong 
them at will by observing analogies” (De Analysi per Aequationes Numero Terminorum Infinitas in 
[18] VoL.II, p.206). In one of his letters to Oldenburg (for onward transmision to Leibniz), the Epistola 
Posterior [19] Vol.II, p.130, Newton writes about how he “first chanced upon" the general binomial 
series as an extension of the results of Wallis by means of “intercalation” and “interpolation”. On the 
question of the nature and role of proof in Indian mathematics, the reader may find it instructive to read 
Srinivas’s essay in [Sarma] alongside Roddam Narasimha [3] cited earlier. 
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for arbitrary (say real) coefficients f;. (Its elementary proof is the same as in the 


special case f; = i^). One recognises it for what it is by passing to the limit, 
rewriting it as an identity for integrals 


I (x — y) f(y)dy =| ay f f(z)dz 
j 0 0 0 


of real functions f. To prove it the modern way, define 


g(x):— ji S(y)dy 


| so that, by the fundamental theorem, f(x) = dg(x)/dx. This enables us to integrate 
the second term on the left by parts. The left side is then 


xg(x)— E PE dy = x(x) — x(x) + 1 g(y)dy = j^ dy ih Sf (z)dz 


which is the right side.“ 


— 


——————— € a wow + » 
D TT TENTAT HEURE SUI m en cg nm 
Pe e »-— *c " me " 


3.5. Trial and Error 


Having found an exact expression for z /4 as an infinite series of alternating recip- 
rocals of odd integers, YB concludes chapter 6 with a section which Tampuran and 
. Ayyar title accurately “Refinements needed to lighten (or shorten) the work of 


TC VPN 


-and-error procedure of samskaram leading to two different 
e, faster-converging exact infinite series or approximate but 


that the finite version of the fundamental theorem is self- 
and is never mentioned in any form in YB. This is an 


ya-samskara: Final correction terms", The 
obtained by dividing by higher. and 
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essential to look for methods of reducing the computational burden, but this is one 
motivation that is easy to guess. The z series is unfit for vyavaharam, on account of 
its slow convergence - the first 56 terms add up [TA] to a value which is wrong in the 
third significant figure, much worse as an approximation than Aryabhata's asanna 
rational value which is correct to the fifth significant figure. For an endeavour that 
had its origin in dissatisfaction with rational approximations to z, this must have 
been galling. 

In every other respect, however, this section is completely out of character. For 
the first and only time in the book, it has a good pedagogical discussion of mat- 
ters that can be considered genuinely algebraic, namely operations on polynomials 
and rational functions. Since that discussion, fleeting as it is, is modelled on arith- 
metic with decimal numbers, it will be given its due briefly in section 5.1. More 
remarkably, after explaining at length the ‘philosophy’ of an ingenious and novel 
semi-empirical technique of samskaram, YB contents itself with listing several re- 
sults the technique is said to lead to by quoting (Sanskrit verses) from TS, without 
proof or any sort of justification — there is no yukti in these passages and little 
bhasa. 

The conspicuous absence of yukti has encouraged some modern commentators 
to attempt to supply one, and some of the work in that direction has led to an 
intriguing situation. For instance, C. T. Rajagopal (one of the pioneer scholars of 
YB) and M. S. Rangachari have pointed out [26] that there is a beautifully recursive 
procedure to evaluate the truncation error exactly as an infinite continued frac- 
tion whose first few approximants coincide precisely with YB's recipés. So, did 
Madhavan or Nilakanthan use such methods to arrive at their spectacularly effec- 
tive results? which YB for reasons unknown decided not to elaborate? Within this 
mystery lies another riddle. Indian mathematics as a whole seems curiously un- 
willing to employ continued fraction expansions, a recursive device par excellence, 
especially as generated by the Euclidean algorithm. (Its rudimentary use in kuttaka 
seems to be the exception that proves the rule). As of now, there is no satisfactory 


explanation; how these approximations were arrived at remains something of an 


enigma.” 


The philosophy of the refining process is, briefly, as follows. YB first expresses 
the remainder after summing up to the th term (with denominator 2i — 1 =: j; so 
j runs through the positive odd integers) as 1/r; with sign opposite to that of the 
last summand (r; is therefore positive for all j). Then the remainder after truncation 


ae 
42The third and final (as far as YB is concerned) approximant for the correction after 56 terms 
improves the accuracy of x from the second to the eleventh significant figure [TA]. 

*3Rajagopal and Rangachari are, very properly, cautious on this question: they preface their continued 
fraction solution of the equation satisfied by the remainder with the sentence, "Let us make some 
observations which (in their main part) go beyond the evidence of the surviving Keralese texts e 
perhaps not irrelevant" [26]. 
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one term earlier is given by the recursive equation 


1 l 


1 — 


EJ os Tiy 


The basic idea is to think of the sum of the first (j + 1)/2 terms of the z series 
for a chosen j as the first guess in a process of samskáram and the remainder 1/r J 
as the correction resulting from the would-be samskāram. The equation above for 
+i the remainders (the error equation) is the condition for the samskāram to give the 
exact value for the remainder and the aim is to use it as a guide to arrive at a ‘good’ 
i approximation. After recognising that an exact solution (as a finitely expressed 
| function of j) is probably impossible, the discusion proceeds to make a patently 
absurd first guess, but now for r j, namely r;_2 = rj = 2j, and rejects it. The reason 
given, in a first explicitly functional piece of argument (the notion of a function in 
i a geometric setting is implicit in the generalisation of the z series to the arctangent 
series, but here there is no longer any geometry to fall back upon), is that the 
remainder after the j-term must depend on j “in the same manner" as the remainder 
after the (j — 2)-term depends on j — 2, i.e., that r must be a (non-constant) function 
of j. So YB now offers a revised, functionally consistent guess, r; = 2(j + 1) + x, 
| rj-2 = 2(j — 1) + x, which has the property that r; — rj. = 4, independently of 
J and x. This of course does not solve the error equation but only an approximate 
i version of it, 
t 


1 1 1 


] A €— — => -> 


: E m 
1] "= re GAY T d Tj-2 rj J ET j 


d approximation for j >> 1/j —for truncation after 56 terms for 
order of 100 and the accuracy is roughly 1 part in 10,000. (YB 
m, omputes the inaccuracy as the difference 1/r;_2+1/r;—1/j). 
en bett approximations is now carried further. The variant 
j-3 (keeping rj —rj-2 = 4; no reason for doing so is given) 
agai, rejected since the inaccuracy is now one order in j higher 
"the re refe erence. istoa | place-value notation with variable 

| of j, see: ction 5 -1). At this point, the account 

ifference between rj and r;..; and suggests 
2j 2+ 3/2) 3 2. rj = 2j -2 + x/(2j — 2), 

t place in j”, (“as said by the 

ns ). The approximations for 
itter E [TA] and, in English, 
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samskaram for the remainders into a reordered infinite series for z Sy sustituir 
the chosen correction terms for each of the terms in the original sene <. i 
replacing 1/j by 1/rj—2 + 1/r;, resulting in 


7 ( I ] ) ( 1-9 

=- hta (6 a 

4 TENTE Ta r$ 
If the r; satisfy the error equation, this is an exact bet logically deut vexit 
1/4 = 1 — 1/r;; itis just the definition of r;. It becomes imeresiing onto f dee = 
are approximate and do not satisfy the equation. But then i îs nc longer dea Sate 
series sum up correctly to the value 7/4, though one may suspect comesa = ve 
know now, that they will if the r; are sufficiently fine (sitiemen zpprommatans 
The series resulting from various choices for r; (see [TA]. (26) er FSS) are mie 
remarkable and some of them have been linked, via contineed fractions. 9 ceram. 
results of Euler and of Ramanujan [26]. 

From the recursive perspective, this approach is an almost perfect iusmrattum of 
the general philosophy of samskáram: one begins with an identity, namely ie series 
for æ in terms of rj, then approximates each rj in such a way that the emors are 
always of order higher than j and will (hopefully, since the procedure is very formal) 
cancel out in the end. Indeed, by the time chapter 6 is brought to a conclusion. we 
have been exposed to a whole sequence of samskarams, nested one within another, | 
of varying degrees of siksmata. The concluding passage is yet another verse from 
TS, containing yet another approximation, finer by one degree in j — the third 
approximant in the continued fraction expansion [26] of 1/r; — leaving the question 
of where it all really came from tantalisingly open. Á 


4. Varieties of Recursion — the Sine/Cosine Series cow ie) 
4.1. From Aryabhata to Madhavan 


If the z series and the arctangent series are the log 
improve upon Aryabhata's approximate ratio 
series are, in the same measure, the final goal 
table. Already in AB, the two problems are 
precise value for z , a table of sines accurat 
made, no matter how finely thecirc le 
general idea and the recipé for the con 
11 and 12) are immediately precede 
Aryabhata’s level of preci ere 
fine elal ofa 


328 P. P. Divakaran 


The surprise then is that the breakthrough took so long. The two Bhaskaras, 
both devoted Aryabhata followers, seem to have missed the deeper meaning of 
Ganitapada \\ instructing the listener (or reader) to divide the circumference so 
as to make it possible to find as many sines of equal arcs as one likes (yathesta). 
Whatever the reason for this oversight, and that is another story, it took until the 
advent of Madhavan for Aryabhata’s vision to be finally realised. 

The accurate/exact evaluation of z having been dealt with in chapter 6, the first 
half of chapter 7 of YB (upto and including section 7.8 of [Sarma]) is concerned 
wholly with the circle of ideas pertaining to the accurate/exact evaluation of sin 0 
foran arbitrary value of 0. Quite remarkably, the first topic addressed is the original 
Aryabhatan numerical table of sine differences (of limited accuracy, necessarily) 


for values of  — mz /48, m = 0, 1, --- , 24, covering the first quadrant. (I shall 
use € to denote the angle 7/48, Sm for sin me, ós,, for the difference Sm — s,,—, and 
^s, for the second difference OSm+1 — OSm = Spa. 1 + 5m—1 — 25m). One might have 


thought that this was wasted labour for the author, knowing as he did that an infinitely 
accurate procedure which worked for any angle was soon going to follow. Apart 
from showing reverence for knowledge from earlier times (*pürvasastra", meaning 
really AB), and familiarising the reader with the algebraic properties of differences 
and second differences of functions in a familiar situation, this may have had a 
practical motivation: the computation of the table using Aryabhata's formula or the 
equivalent procedure given in YB takes much less effort than working with power 
series. In fact the section of YB containing the yukti behind the table is followed 
by the description of Madhavan's interpolation formulae for sines and cosines of 
angles lying between two tabulated angles making it clear that, for-purposes of 
astronomical calculations, there was no need to go any further and also that the sine 
table was an essential first step. We are once again made aware that the development 
of the power series owed less to practical needs than to the attraction of a problem 
that was there and that could be solved. 
In any case, from the point of view of this article, the interest is in the fact that all 
three phases of that enquiry, tabulation, interpolation and power series expansion, 
are addressed by recursive methods of increasing subtlety. 


4.2. Making the Sine Table 


One point of departure for Aryabhata's sine table (the expression will stand for a 
table of either the sine differences or the sines unless explicitly distinguished) is the 
value of sin x /6; this is attested by (the second line of) Ganita 9, to the effect that 
the side of a regular hexagon is equal to the radius of its circumscribing circle. Thus 
the Ganita stanzas 9 to 12 take us along one logical sequence of steps leading to the 
table, beginning with sin zz /6 = 1/2, through the approximate value of z (3. 1416) 
and the somewhat enigmatic (at first reading) verse 11 instructing us to divide the 
circumference into as many equal arcs as we please, and ending with the formula for 
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OSm. In view of the content of verse 11, it is pertinent to wonder why this particular 
subdivision was chosen and why it stopped at 7/48. As often suggested (first by 
Playfair [20] on the basis of his acquaintance with Süryasiddhanta), it would have 
been straightforward to compute sin z /48 by halving 7/6 successively three times 
and using the theorem of the diagonal to relate the chord of an arc to the chord 
of half that arc, a relation almost certainly known to Aryabhata; YB explains the 
geometry and gives the formula at the start of chapter 7, sections 7.2.1 and 7.2.2 of 
[Sarma]. Indeed YB concludes this part by stating that the sines of some of the 24 
Aryabhatan angles can be obtained in this way while "some others" (effectively all 
the rest) can be found by starting with sin z /4 and similarly subdividing. The whole 
table could thus have been made by direct computation using a formula for sin 0/2 
that Aryabhata knew. Given this fact, it is even more pertinent to wonder why his 
table as well as his recipé for making it deals only with sine differences, not the 
sines themselves. The germ of a plausible answer may lie in a second observation 
of Playfair's, that 2/48 is the largest angle in this sequence for which the chord is 
equal to the arc to a precision of one minute. 

A table of sines is a staple of Indian astronomical texts, right until the time of 
YB and beyond, as it was an essential aid to the computation of the true positions of 
heavenly bodies. In fact, by around the 9th century, the table acquired a canonical 
katapaya form and, at least in Kerala, seems to have been just transcribed from 
text to text. From very early times, what is most often tabulated are the sines, 
not their differences (presumably since those are what is directly employed in 
planetary calculations) but the formula itself, when it is stated, is always for the 
sine differences. Already in Siryasiddhanta, we find the table presented (in stanzas 
II.17 - 22 in the version translated and edited by Burgess [27]) directly for the sines 
while the formula (in II.16) is for their differences: 


1 
Sm — $m—1 = $1 — eSI Toc Sm-1): 
1 


In all essentials this is the same as the more ambiguously expressed formula of 
AB as interpreted by K.S. Shukla in his annotated translation [28]. Shukla in fact 
gives several equivalent rewritings of the formula, due to various traditional com- 
mentators as they came to grips, each in his own way, with Ganita 12. All of them 
employ Aryabhatan units, namely minutes, for both an angle 8 (equivalently the 
arc subtending that angle) and its sine (equivalently the half-chord of arc 20 of 
the standard circle of circumference 21600 minutes and radius 21600/2z = 3438 


+41n his commentary on these stanzas Burgess expresses the curious view that the table was prepared 
not by using the formula but by direct subdivision, the formula being obtained empirically by a subsequent 
inspection of the table. Why would anyone first state a rule that was not going to be used and then use a 
different rule, unstated, to get the same results? 
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minutes).? In these units and to an accuracy of 1 minute, ds; = sine = € = 225, 
6s = sin2e — sine = 224 and so ó^s, = —1. Consequently, a multiplicative Os, 
disappears from all formulae. All of the early commentators except one incorporate 
the resulting numerical simplification (a minor mischief which is easily set right by 
restoring the exact formula for 651) as well as the approximation sin € = e in their 
versions of the sine-difference formula. 

The exception is Nilakanthan. Reading more into Ganita 12 than is literally 
there, he interprets the verse in a form free from conventions and approximations 
as the exact equation? 


Sm). 
OSm+1 = OSm = pens T 0S2) 
1 


which, for later reference, I rewrite in the explicit second-difference form 


Sm 
O Sm = — s. 
$1 


Nilakanthan has himself given a geometric proof of his formula which can be 
found, in modern notation, in Shukla [28] along with an equivalent trigonometric 
formulation. I recall that proof here after highlighting what appears to have been, at 


least by the time of the Kerala school but possibly from much earlier, the “conducting 
thread’ leading to the consideration of differences and second differences of sines. 


45A tribute is due here to one of the less celebrated innovations of Aryabhata, the recourse to a 
natural, angular unit to measure chords in terms of the fixed angular 'size' of the circumference. This 
is the converse of the current practice of measuring angles in radians and defining the sine as the 
corresponding half-chord in a circle of unit radius and is strictly equivalent to it; in particular the use of 


* modern sines etc. in the present work in place of the traditional Rsines etc. is in no respect misleading. It 


is in measuring chords in minutes that the value of zz comes into play just as we need z to calculate the 
circumference of the ‘unit circle’. 

4611 is not difficult to see in this exact generalisation of the original approximate rule a parallel with the 
other instance in which Nilakanthan has read Aryabhata's mind, when he takes the one word dsanna to 
mean that z is irrational. Surprisingly, neither of these instances of Nilakanthan's insight finds a mention 
in YB, surprising in the first case because it is uncharacteristic of YB not to state a general rule when it is 
available and, in the other, in view of the thoroughness of YB's treatment of the complex of issues relating 
to the z series. Besides, how can it be that the author of YB, a work composed to provide an exposition of 
the new mathematics "according to TS" as announced in its very first sentence, is unaware or negligent 
of such illuminating wisdom in the teaching of his own direct preceptor? A possible answer may lie in 
the fact that both these examples of mind-reading occur in Aryabhatiyabhasya which Nilakanthan wrote 
late in life and in the possibility that this work was composed after YB. Such a resolution is of some 
chronological interest. K.V.Sarma’s earliest date for YB is around 1530, by which time Nilakanthan 
would have reached the ripe old age of 85 or so. Knowing that he wrote at least two other works 
after his commentary on Aryabhata (Sarma’s introduction to{1}), one of them (the vyakhya of his own 
Siddhàntadarpana) of major epistemological importance, it is not unreasonable to assign to it a date 

a ; +f thi rk postdates YB as suggested here, 1530 will be d 
well before 1530. Correspondingly, if this work pos more like 
a strong upper limit date for YB. 
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That thread starts with the realisation of the inadequacy of the rule of three 
(trairasikam) when dealing with quantities whose mutual dependence is not linear, 
in the present instance the ‘sides’ of ‘similar triangles’ when one ‘side’ is an arc 
rather than a chord of the circle. Just before the description of the computation of 
the sine table (at the end of section 7.4.1 of [Sarma]), YB has a brief but enlightening 
passage on the error of relying on the rule of three to determine chords from arcs. 
After warning that it will lead to gross (sthilam) results, it ends with an explanation 
and an exhortation: 


The reason for this [grossness is as follows]. The second arc (2e in our notation) 
[is] twofold the initial arc, the third arc threefold. Thus the arcs. The second chord 
is not twofold the initial chord, the third chord is not threefold, and so on. The 
reason for this [is as follows]. The initial arc has no curvature [and is] almost equal 
to the chord since the saram (the versine, 1 — cos €; see figure 7.2 of [RSS] for the 
terminology) is small. . . .So do not do trairasikam with respect to the arc because 
the result will be gross. 


In effect we are told that sin 8 is not a linear function of Ó since the relation between 
the arc and the chord is not one of proportionality. In particular, Sm is not the mean of 
554.1 and Sm—1 and the difference ds,, is not constant. The grossness in the value of 
Sm that will result from doing trairaskam is thus measured by the difference between 
Sm+1 + Sm—1 and 2s,, or equivalently between ds»; and ds,,, in other words the 
second difference. The line of thinking here is evidently very similar to what we 
have already met in other varieties of samskaram: the gross first guess is that the 
sine can be linearly interpolated, i.e., that the difference Sm is independent of m, 
and the next step is to evaluate the deviation from linearity as the second difference. 

Nilakanthan does this geometrically by considering two astutely chosen pairs of 
similar right triangles (Shukla's trigonometric proof uses the addition formula for 
sin(m 4- 1)e and sin(m — 1)e) leading to 


Sm = 25m(cos € — 1). 


Nothing is more natural now than to eliminate cose — 1 in favour of 6s, and to 
arrive at the formula written down earlier — Nilakanthan's equation is thus exact. If 
the initial values s; and s? (and hence ôs; = s2 — 25;) are also exactly known, the 
solution will give the exact values of all ó?s,,, all ds», and all Sm recursively. Such an 
exact solution will satisfy all the symmetries of the sine function if the computation 
is carried beyond the first quadrant: s3, = 1 — sj, etc. and be periodic: s97 = s4, etc. 
Here then is the quintessential illustration of what I have called cyclic recursion in 
section 1.7. 

Thus Nilakanthan. As for Aryabhata, while he may well have been led to second 
differences by the failure of the rule of three, it appears doubtful that he arrived at 
his formula along this precise path. Apart from the evidence of the actual words 
of Ganita 12, the construction invoked by Nilakanthan was probably unknown 
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to Aryabhata, being (along with several variants) a characteristic of the Kerala 
approach to circle geometry." Every piece of writing bearing on this question 
points to Aryabhata taking recourse, from the beginning, to the approximation 
sı = € along with the known formula for doubling or halving the angle to evaluate 
ó^s, as 


2 . ? 
ósi = —2s).2 sin? — = —s,€? 


NIMA 


and proceeding thence recursively. More generally, Nilakanthan's equation will 
take the corresponding approximate form 


2 2 
D Sn =E CE. 


There is no need to underline the enormous significance of this equation. As € is 
further subdivided into arbitrarily (yathestam) small pieces (“atomised”’), the ap- 
proximation will tend to exactness, the difference equation to the correct differential 
equation for the sine and the two initial values sin 0 = 0 and sin € = € to the appro- 
priate initial conditions for its solution. Solving the equation also transforms into a 
subtle iterative process, yet another productive variant of the method of samskāram. 
This is one of the ways in which the sine and the cosine series were finally nailed 
down by Madhavan. YB's own treatment of the table is meticulously adapted from 
the start to a general division of the quadrant and to the eventual goal of making 
each arc segment infinitesimally small as we will see in section 4.4. 


4.3. The Interpolation Formula - Simultaneous Samskaram 


The description ofthe sine table is immediately followed in YB by a section outlining 
how to use the tabulated values to estimate sin 9 and cos Ó when @ lies "in the gap" 
between two consecutive tabulated angles. There are hints that this topic may have 
been considered a distraction: it is brief to the point, almost, of impatience — the 
whole passage is two paragraphs and less than one printed page ([Sarma], section 
7.4.3) long. The operative paragraph sketches a geometrical construction in terms 
of the inevitable pair of similar right triangles and indicates (barely) what to do with 
them in order to obtain a second order interpolation formula. The reader will find 
the geometry in-Fig. 7.6 of [Sarma] and an explanation of the derivation in [RSS]. 


47Shukla’s use of the addition formula is of course unnecessary and may be misleading, While 
the sine difference formula follows directly from the addition formula, the reverse implication is less 
direct. In any case, Aryabhata almost certainly was not in possession of the general addition formul A 
The distinction of its discovery also belongs to Madhavan, going by NOR adh references ONE 
jiveparaspara-nyayam (literally, the principle of adjacent chords; more in: ormatively, the principle or 
taris of adjacent arcs) and its heavy use by his followers (including in YB). 
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For an incremental angle ô :— 0 — me < e, the formulae are 


; , s r 1 4 
sin(me + ô) = sin me + ócosme — 39 sin me, 


: l 2 
cos(me + ô) = cos me — ósinme — 29. cos me 


and they depend essentially on the Aryabhatan approximation of sin 2/2 by ó/2. 
The section ends with * If this is not of sufficient accuracy, take the full chord of one 
quarter of the residual arc (i.e., 6/4 in place of 6/2) . . . if this is also not sufficient, 
take half of that . . . " followed by a reference to a verse in TS. The paragraph does 
include the word samskaram but that is about all. 

In fact the reasoning behind the interpolation is interestingly recursive. The 
sentence cited above asks us to take the computed approximate value of sin(me + 
65/2) as the new input and, proceeding as before, to calculate the new output sin ĝ as 
sin((me+6/2)+6/2); obviously the process can be continued indefinitely. But itis to 
be stressed that while there is an infinite samskaram at work here, based on a nested 
sequence of segments of arc of length 6/2! in geometric progression, the segments 
are not taken to zero at any stage — there is nothing infinitesimal, no calculus. 
Perhaps this accounts for the inhabitual abruptness with which YB terminates the 
discussion of the interpolation. To follow this particular thread, therefore, we have 
to rely on sources other than YB. 

It is useful to start by recognising that the only point about making the expansion 
around the Aryabhatan angles @ = me is that their sines and cosines are supposed 
known from the table; the geometry itself works for any value of 8. Similarly, though 
ó is taken to be between me and (m + 1)e, the geometry works for any angle à. With 
this in mind we turn to Kim Plofker's careful analysis ([29, 30]) of the far more 
extensive account of this topic by Jyesthadevan's younger contemporary Sankaran 
in his commentaries on TS. As Plofker has underlined, the formulae are established 
for both the sine and the cosine simultaneously by setting up a pair of coupled 
equations for the correction: the output of alternating steps in the refining of the 
correction, say the ith for sin(@ + à), involves cos(@ + 0/2/-!) whose evaluation 
involves sin(@ + 0/2") and so on. The expressions resulting from iterating the ste 
ad infinitum (Sankaran does the first seven of them) is strongly reminiscent of T 
nested brackets occurring in the treatment of the geometric series by means of 5 
elementary recursive identity but more spectacular ([30]): of the 


. ô 
sin?) = sing +3 (2cos0— 5 (sing 4 2 cos@  Ó (sin 
2 2^ RE is ))): 


ó F ô 3 
cos(8--9) = cosd—— 2sin@ + 3 (cosa — â (= de ô cos @ 
à z 2 ET ss -)))) 3 
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Removing the brackets turns them into infinite series in powers of ó (equations 
28 of [30], extended by repeated substitutions, in a slightly different notation), the 
numerical coefficients being, naturally, increasing inverse powers of 2. 

To our eyes these are strange series. The coefficients of successive powers of ó 
involve successive derivatives of the sine and the cosine linearly as in the Taylor 
series and the first two terms are exactly the same as in the Taylor series; but Taylor 
series they are not. We can begin to guess the precise source of the error (see the 
next-but-one paragraph) but that the series are wrong is made absolutely clear by 
the fact that they specialise at 9 = 0 to (equations 29 of [30]) 


oo Loe 
maoe aaa 
Zn Bt 425 


o de 
cos6=1-2(5- 12e). 


which are together inconsistent with the fact that sine and cosine are each other's 
differentials up to a sign, something certainly known to Madhavan and much used 
in YB. And who could have failed to notice that they are inconsistent also with 
Madhavan's own sine/cosine series? 

Not only is the space devoted in YB to the subject of interpolation dismissively 
short, there are also no statements in that short space to the effect that an exact 
expansion for sin(Ó + 6) will result if the recursive construction is continued forever 
(no *atomisation", no division by pararddham), statements of the sort found quite 
freely in connection with the z series (sections 6.3 and 6.4 of [Sarma] cited earlier) 
and the sine/cosine series (e.g., sections 7.5.2, 7.5.4 and 7.5.5). Is one to conclude 
then that for Jyesthadevan, unlike perhaps for Sankaran, the formulae were no 
more than a reliable numerical approximation, a means of extending the utility of 
the sine table, with no deeper significance? If the answer is yes, that will be yet 
another piece of evidence for YB's infallible instinct for what was genuinely deep 
and original in Madhavan's accomplishments, but there may be a more prosaic 
explanation. The recursive element in this small section, in particular the idea 
of simultaneous samskaram, comes into its own when used in conjunction with 
Madhavan's infinitesimal philosophy to generate the sine/cosine series. In that 
context it certainly gets its due attention, in the sections of YB that follow. 

The point to be underlined at the end is that doing a sequence of iterative 
corrections of the type seen above is not by itself calculus, any more than the 
geometric series obtained in the same way or the general binomial series is calculus, 
irrespective of whether the interpolation series coincide with the Taylor series or 
not and, even, whether they are correct or not. The essence of the Kerala approach to 
calculus was the same as it was to be in Europe and easily described in the language 
of its modern version: divide the range of the independent variable (the tangent toan 
arc in the arctangent series and the arc in the sine series, next subsection) into n equal 
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parts with n eventually to be made to tend to infinity, linearise the dependent variable 
(the arc and the half-chord respectively in the two problems) as a function of the 
independent variable at each point of division and add up the linearised quantities 
ensuring that the remainders so omitted tend to zero in the limit, thereby implicitly 
establishing/employing the fundamental theorem. The interpolation formulae are 
outside that framework. The ‘independent variable’ ó is subjected to an iterated, 
nested subdivision as in the case of the geometric series (hence the similarly nested 
final expression) and as a result the segments of 6 do not tend to zero in the limit 
(they are not differentials); one only has an infinite sequence of segments whose 
limit is zero. There is a process of linearisation, but the ‘error’ involved in that 
process is not controllable in the limit. At the fundamental level, therefore, there 
is no conceptual linkage between the two. These shortcomings could be overcome, 
and were, by Madhavan himself in the course of developing the sine/cosine series 
around 0 = 0, but then it becomes calculus. He could also have rewritten the series, 
but did not, as expansions around any other point, true Taylor series [16], by noting 
that the length of an arc did not depend on its initial point being the direction east, 
that it was invariant under translations of the circle. 

It is in this light that the controversy that erupted a few years ago around the 
question of whether calculus was invented in India (the debate can be followed 
starting with [30] and the references therein) is to be viewed. The misinterpretation 
was to some extent inevitable: it owed its origin to a reading of the wrong material 
(the interpolation formulae) from the wrong sources (Sankara Variyar's elaborations 
on TS) while the right source remained, untranslated and largely unstudied, hidden 
in a language inaccessibe except to a few. Thanks finally to the long overdue 
translation of YB by K.V.Sarma, aided in no small measure by the pioneering work 
of Tampuran and Ayyar to all three of whom this article is a tribute, that reason is 


no longer there. 


4.4. From Differences to Differentials 


The development of the sine and cosine series 


. 9 85 
sing = Oe tanta 
and 
62 64 
COS 8 — leu zie 


borrows recursive elements both from the construction of the sine table (working 
with sine differences) and from the interpolation formula (the method of simulta- 
neous samskaram). The critical input however is the novel concept already put to 
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use in the theory of the arctangent series, the infinitesimal geometry of Madhavan. 
Though the conceptual and the technical go hand in hand, even more insepara- 
bly in this part of YB than elsewhere, my emphasis will again be on the recursive 
techniques employed; the originality of the calculus component of the arguments 
will figure only incidentally, mostly in this subsection and in section 4.7. So the 
description here of the geometry is minimal (greater detail can be found in [RSS]), 
only to the extent required to make the working out of the steps comprehensible and 
to highlight the specific steps that bear witness to the birth of calculus. 

Thus we begin by considering a circle of unit radius and an arc of it subtending 
an angle ĝ at the centre. The arc is divided into a number of equal segments and 
the objects of study are the sines and cosines of the angles defined by the points 
of division, geometrically the half-chords through these points and their distances 
from the centre, as well as the corresponding quantities at the midpoints of the 
segments. Nothing is lost and some extra effort in writing saved if we divide @ into 
an even number 27 of segments of length ô :— 0 /2n from the start; so the quantities 
of interest are s; :— sin ió and c; := cos ið, i = 1, 2,---2n. 

The reason forconsidering the midpoints of the arc segments simultaneously with 
their endpoints becomes apparent quickly. Applying trairaskam to two suitable pairs 
of similar triangles, YB shows that the difference in the sines (cosines) of two angles 


differing by two segments is proportional to the cosine (sine) of the intervening 
angle: 


OS}; = Si+1 — Sji-1 = 2s1Ci, 
OC; <= Cjy] — Ci-1 = —25\5;. 


These exact, elementary and beautiful relations are thus the finite difference ana- 
logues of the differentials of the sine and the cosine. They are also true in the more 
general form 


sin($ + x) — sin($ — x) = 2sin y cosh 


for any ¢ and y as a quick glance at the geometry makes clear. Though no different 
in method or conclusion from Nilakanthan's treatment of the tabulated sines (with 
the arc segment fixed at z /48), their general validity for an arbitrary division is now 
acknowledged and underlined for the first time: "Here the arc segment has to be 
imagined to be as small as one wants . . . [but] since one has to explain in a certain 
[definite] way, [I] have said [so far] that a quadrant has twentyfour chords" ([Sarma] 
7.5.2). Other issues addressed by this simple construction/observation include the 
quantification of the deviation from linearity (the inapplicability of the rule of three) 
and the fact that while the difference formulae follow from the addition formulae 
the former can be established more directly and simply. s > 

The second differences are defined in the same way: ^s; i= Osi4) — em 
and similarly for the cosines. Because the sine and cosine are each other's first 
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differences, the second differences can be calculated without any further geometry: 
ó^s; = 2s,óc; = —4s?si, 

Oc; = —25,0s; = —4s?ci. 


Consequently the content of the second difference equation is identical to that 
of the pair of coupled first difference equation. YB deals with both sets, often in 
conjunction, in its search for the solution. The slight lack of focus that results — 
the treatment on the whole is less streamlined than that of the arctangent series — 
leaves one with the impression that this first record of the detailed understanding 
of what was after all a revolutionary advance was not completed without struggle. 
In any case, the critical fact that these equations (especially the second difference 
equation) which were originally derived for 26 = 7:/48 (and for that value of ó 
are the same as Nilakanthan's starting point for the sine table) remain valid for any 
value of 6 gets its due attention as attested in the quotation above. 

At this point it is convenient to break with the order in which YB broaches the 
questions that arise in finding the exact solutions in the limit and note the obvious 
fact that the equations themselves are well-defined in the limit as the familiar 
differential equations satisfied by the sine and the cosine. As has been said several 
times already, YB's way is to first solve them recursively and then to take the limit 
of the solution. That will be described in the following two subsections. For the 
present, let us just note that in the limit of large n, the angular segment ô = i0/2n 
tends to 0, the chord length s, = 2 sind tends to 26, id becomes some general angle 
ġ < 0 and the difference equations become the differential equations 


ds() _ 
dd Tem c($), 
del) — 
CE s($) 
and 
a? f(g) _ 
aA —f($) 


respectively for f = s orc, defining df (¢)/d¢ :— lims.,o( f(¢+6)— f(¢—6))/26. 
These equations cannot be solved by simply integrating their right sides unlike in 
the case of the arctangent series where the problem was reduced geometrically 
to a quadrature, namely solving the first order differential equation d@(t)/dr = 
1/(1 + t°), and then further reduced to evaluating integrals of powers by fear 
of the recursively generated geometric series for the inhomogeneo 


solution of the corresponding difference equations will face the eS es Sd 
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4.5. Recursive ‘Integration’ of a Difference Equation 


To repeat, the difference equations for the sine and the cosine are homogeneous 
equations, involving the same unknown quantities on the right as are sought and so 
no direct evaluation or quadrature is possible. It is the strategy devised to get around 
this difficulty that makes the final result, the sine/cosine series, the most impressive 
achievement of YB's calculus. Apart from the calculus itself, the strategy involves 
several recursive novelties of which the most fundamental is a process of samskaram 
which kicks off with the first approximation of replacing the unknown chords by 
the corresponding (known) arcs (i.e., sin $ by ¢) and repeatedly refining it. 

As a preliminary to the samskaram, YB makes the elementary observation that, 
in general terms, for a function f(x) and a division (xo, x1, - - - x4) of the domain 
of x, 


fi) — f(xo) = Of Gi) + Of (xi i1) +--+ f(x), 


with Of(x;) :— f(x;i) — f(xi-1): the whole is the sum of its parts (the twitsoip 
principle? the fundamental theorem of pre-calculus?). For the sine function, this 
allows us to write 

eas S2n = OS2n—| + OS2n—3 +--+ +054, 

taking account of our symmetric definition of ô and the initial value so = 0. The 
difference equation for s can now be used to rewrite this as - 


m = 25; (6251 -FCa-3 + +++ +01) = 251(Con—1 + +++ + c3) + 5p. 
E. x. ee 


s also the sum of its parts, 


Sith = (n— 1)s2) + nsz. 


Te 


s) ens. tutt 
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same fwitsoip principle twice, to the second and first differences successively, to 
get to the same representation as above of 55, (and c2,—1). It seems appropriate to 
call it the samkalitam (a word used to mean both the finite sum and its limit as an 
integral in YB) representation. 
What does this representation represent? Nothing more or less than the finite- 
difference counterpart of the formal integration of the relevant differential equation. 
For instance, in the limit ô — 0 of the representation 


n 
Son = 25] > C2j-1; 


j=l 


2s, tends to d@ and the sum becomes the integral: | 


8 
s6)- J c($)49. 


Similarly the representation of S2 as a sum of s2; becomes 


=- | "dd Jj f apso 


In other words, modulo the imposition of initial conditions, what is in play here 
is what becomes in the limit the technique of turning differential equations into 
integral equations by an appeal, usually tacit, to the fundamental theorem. 

The samkalitam representation continues to hold when n is replaced by any ] 
i « n, since @ is an arbitrary angle. For alli > 1 it is a nontrivial expression of 
the content of the difference equations (for i = 1 it is empty: s2 = 52). But it is 
an E result — it ues not solve the problem of finding ind as an pcr 


arc (istacapam). As [we] do not know any of the ; 
the arc itself and do the samkalitam of the 
is to replace sz by s$} := 26k for all k 
from the E 

refining, it seems bes 


MID C amo im ae 


T 
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with 
SOD 
j=1 


In particular $;(1) = 1 for all k. They have simple algebraic expressions which 
have an interesting history and to which we will pay some attention soon but, for 
the present, it is enough to work with the definitions. 

The samskāram is carried out as follows. In the first step set 55; as sp — 2ió for 
alli < 1 buttreatthe factor 4s? =: a asa known parameter not subject to correction 
since there is no equation to determine it. With this input, the first approximation 
for sz, is a sum-of-sums: 


s(D.— 2n5— 2aó[((n — 1)-- ----ED-- (n 2) 8- D 4-1] 
= 2nà — 2aX(Si(n = 1) +--+ S1(1)), 


and hence simplifies to 
s$? = 2ôn — 26a. Sy(n — 1)). 


In the next step, the input values se ) fori = 2,--- ,n — 1 (s2 does not change 
Since there i is no equation to Beaming it) are Ei sen as the output of updating the 


s = 26(i — a Szli — 1)). 
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$2, because the error in the first guess of replacing chords by arcs will propagate 
through the iterations. One can undo the error by choosing sz; = isz (i.e., i times 
the first chord segment rather than the arc) as the first guess and carry out further 
refinings as before, resulting in 


: EC, a O 6 
sin@ = sin — ( — 4sin? — S5(n — 1) + 4 sint —S,(n — 2)+-- ‘) ; 
n 2n 2n 


For n = | we get the tautology sin ĝ = sin as expected, for n = 2 the Aryabhatan 
relation sin@ = 2sin(8/2)(1 — 2 sin? 0/4), etc. In fact the formula is nothing but 
the generalisation of Nilakanthan's exact solution for the sine table to arbitrary 
divisions of an arbitrary angle. 

YB does not quite get to describing the finite series. It has, once again, more am- 
bitious goals than a clever trigonometric identity. After setting out the samskaram, 
it straightaway takes the limit of large n (for fixed 6) in all the trigonometric factors 
occurring in the series (with the usual reference to pararddham and "endless atomi- 
sation"): 26 + 0/n, a — 0? /n?, leaving the coefficients $5,(n — k) untouched for 
the moment. As z tends to infinity, the finite series turns into a power series whose 
dominant terms give 


0? 95 
sinÓü ~ 0 — —S(n— 1) + —$4(n—2) —---. 
n? n? 


The asymptotic form of S;(n) is actually easy to find and already available from 
chapter 6 of YB: S,(n) ~ n**! /(k + 1)!, leading finally to the sine series. 

The derivation of the cosine series proceeds in parallel. Also, instead of solving 
the second-difference equation for the sine directly, we could have started from 
the two first-difference equations for the sine and the cosine, written down the 
samkalitam representations for them, made the first guesses sin 9 = 0 andcosÓ = 1, 
refined them in alternation and arrived at the same final series. 

The samskaram technique can be used even more readily after taking the limit, 
to solve the resulting differential equations. As noted earlier, we can appeal to the 
fundamental theorem to formally integrate the differential equations to the integral 
form of the samkalitam representations; for instance the second order equation for 
the sine becomes the integral equation 


8 $i 
sind = 80 -f ag, | dó» sin go, 
0 0 


on supplying the limits and constants of integration and using the initial conditions 
sinO = 0, cosO = 1. The samskdram way of solving it is to substitute the first 
guess sin $; = ¢; in the integrand on the right to produce sin@ = 8 — 0? /3!, then 
substitute this expression back into the integrand and so on. Alternatively and even 
more faithfully to the three samskaram steps listed in section 3.1, replace the sine 
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in the integrand by its integral representation repeatedly. The result after k steps is 
the identity 


0? di $x-1 
sind = OS See Se Eres s ff aps | dh- j d$», Sin hxk, 


which, in structure, resembles the elementary recursive identity first encountered 
Ht in section 3.3 but by now has evolved into a more intricate samskdram equation. 
We know that the last (error) term vanishes asymptotically but, unlike when dealing 


1 
| with the arctangent series, YB does not talk about the convergence of the sine/cosine 
j series. 


4.6. An Inductive Proof That Is Absent 


- In some contrast to its treatment of the coefficients in the arctangent series where 

I" their inductive evaluation was done after their asymptotically dominant form was 

Í determined, YB describes exact formulae for the coefficients S, (n) for general k and 
n. It starts by giving the traditional geometric yukti for sums of natural numbers 

, 09101) = n(n + 1)/2 in surprising detail and then goes on to state the rule for $(n) 
and S,(n) more generally: “Multiply together the padam (the whole arc, in my 
notation 7), padam plus one and padam plus two. The result of dividing it by the 
product of one, two and three is the second samkalitam. In the same manner multiply 
the numbers which are increased by one at a time and divide it by the product of the 
same number [of factors] one, two, etc. The result is the samkalitam which is one 
below" ([Sarma] 7.5.3): 


" im Án Àà— 
m 


Hees pio ny 
AE ess fuu $0) = 10-2 7 0 E09 + ) 
= Serra C+D! 


es hy does YB take the trouble of stating the exact formula for S,() when all it 
Sian ter si symptotic form Si(n) ~ Tay) (k + 1)! which was already 


eit it its earliest occurrence. Sims Tater 
et out in Sankaran’s Kriyakramakari, a commentary 
j e of these sources contains a yukti for 


accomplished and not 
AaS ern formulation 
i i heo En 


= Sg» "^ 


— 


ee m RENE 


Notes on Yuktibhasà: Recursive Methods in Indian Mathematics 343 


Define 


DE k 
T(n) := ne T " = 


Then 


n(n + 1)---(n-—1+k) 
(k + 1)! 
n(n + 1)---(n+k-—1) 


= “Ta a p - T, (n). 


Ty (n) — T(n — 1) ((n 4- k) — (n — 1)) 


But from the recursive definition of $;,(n) we have directly Sy(n) — Sy(n — 1) = 
Sii (1). So the result S,(n) = T(n) will follow for given k and n if $4-1(n) = 


T¿—ı(n) and S,(n — 1) = T(n — 1). It is sufficient therefore to show that 
S,(1) = T;(1) for all k and S;(n) = T(n) for all n. Now i) S,(1) consists of 
the single term S,_;(1) and so on: $,(1) = --- = Sı(1) = 1 = 7,(1); and ii) 


(by definition), 7,(1) = n(n + 1)/2 = S,(1) (as YB has taken the trouble to show 
geometrically). End of proof. 

Even if writing out this proof serves no other purpose, it will have convinced the 
reader that it certainly was not beyond the mathematical (and literary) powers of 
YB: itis quite as easy to turn it around and present it in YB's preferred style, starting 
with k = n = | and building it up one step at a time in k and n. 

In seeking to identify the reason(s) for the general silence on the yukti, we may 
discount the notion that there was in fact no yukti and that the general formula was 
an extrapolation from small values of k — 1, 2 (known since Aryabhata, Ganita 21 
of AB) — the Kerala mathematicians most particularly were not in the habit of stating 
results lacking sound logical support. Sankaran’s enunciation of the general formula 
for $, (1), more precise than YB’s, is actually followed by the line “The yukti of this 
is not easy to follow (not sugama) and so is not described here". (The whole verse 
will be found in [6] p.248). And it is preceded by a geometrical demonstration of the 
cases k — 1,2 involving a clever cutting up of the area and volume of suitable two 
(the same as in YB) and three dimensional figures. As Sarasvati Amma [6] wonders, 
how can such a demonstration be extended to k > 2 “unless one were to conceive 
spaces with more than three dimensions", especially in a work professedly meant 
for “the benefit of the not so intelligent" (alpadhiyam hita)? 

It is tempting to suggest that what the slow-witted really found difficult was 
the transition from a long-familiar geometrical and arithmetical mode of reasoning 
to one based on combinatorial algebra and, even more disorientingly, on novel 
logical uses of recursive thinking that an inductive proof represented. That would 


not be generalised since sums of general powers were not known. I got the inductive proof below from 
Chandrashekhar Cowsik to whom I express my thanks. 
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presuppose that such proofs were a genuine breakthrough, born of a conscious 
recognition that geometric methods had hit their natural limitations and that new 
yukti-techniques were called for. The extreme care with which the first recorded (as 
far as we know now) inductive yukti, for the terms in the z series, is constructed 
and presented in YB is then fully justified — the sums of powers greater than two 
that occur there are as resistant to geometry as the coefficients S,(n). And if the 
only yukti available for the computation of the latter was in fact inductive, we will 
have to conclude that Narayana who lived just before Madhavan, we are not certain 
where, was an adept of such techniques. 

All this only deepens the mystery of the immediate antecedents of Madhavan’s 
achievements, he himself having left too little of his own to throw any light on it. 


We are all the more fortunate then that so many of his followers turned out to be 
such prolific authors. 


5. Two Footnotes 


mc SRP ar > Ree 


5.1. Decimal Numbers and Polynomials 


One of the most interesting sections of YB not directly tied to calculus is concerned 
with the development of a notation and a proper set of rules for dealing with algebraic 
operations on functions involving powers of one variable. The context in which the 
need for such a general algebraic machinery emerges is that of the estimation of 
the correctio ns to be applied when the z series is truncated. As described in section 
"3: id rrj after i = (j + 1)/2 (j a positive odd integer) terms satisfies 
u ion i in j and r; is to be chosen, effectively by trial and error, so as 
closely as possible. It is in this connection that YB makes 
jon to the notion of a function, also as noted in section 3.5.” 
being s fixed in advance, j is treated as an avyaktarasi, “the 
1 integers. YB says ([Sarma] 6.8, p.74): 
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So a natural number n in the first place stands for n itself, in the second place 
for n x j and so on. When the coefficients are fractions, they can be reduced to 
a common denominator, i.e., an overall numerical divisor, and for negative co- 
efficients, YB cites Brahmagupta's rules for multiplication among numbers “of a 
positive or a negative nature" (dhanabhütam or rnabhütam). What results is a nota- 
tion that characterises a polynomial function of one variable (over the rationals) by 
the numbers in the compartments, i.e., by the ordered sequence of its coefficients 
which, up to a common denominator, can be a positive or negative integer — thus ac- 
counting for YB's sudden interest in Brahmagupta's old rule for multiplication with 
negatives.?? : 

The functions that turn up in the samskaram work are not polynomials but involve 
(small) powers of j and 1/j. The error after truncation at j: 


(j) 1 gi IT il 
e = = >> o — m 
J rj-2 rj Ju 


for the trial choice r;-2 = 2j —2,r; = 2j + 2 is an example. Such functions 
are also to be reduced to a common denominator by extending the usual numerical 
method, which involves addition and multiplication, to polynomials. The result 
will be a rational function of which the numerator and denominator are treated 
independently. The algebraic needs of the operations on the functions occurring in 
all the variants of the correction techniques met with in section 3.5 are thus fully 
met if the operations of addition and multiplication are defined for polynomials. YB 
proceeds to explain how that is to be done and understood with the help of relevant 
examples. I quote the first part of that passage ([Sarma] 6.8, p.75). 


Here [I] show how the work is to be done when the unknown (rasi, the reader 
will have noted that this word is used in two different senses in this section) is 
taken to be the last odd number (the truncation point j). Make (literally, “write” or — 
“draw’’) two rows of compartments (khandam) so that each compartment encloses. 5 

one place. In this the upper row is for the numerators and the lower 
denominators. The odd number (the unknown j) is |1/0| (have 
to separate the compartments). If the rasi is negative (since j is als 
what is meant here are the coefficients of powers of j) it must be 
some mark. For zero, whatever thing is used for i 


The divisor in the first correction term (1/(2j — 2 
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The passage continues with a description of the multiplication and division of 
a polynomial by a number (obvious) and the multiplication of two polynomials 
(relying on the fact that j is actually a number though unknown). There are no 
surprises here; the time and care spent on the explanation are perhaps, once again, 
warranted by the conceptual innovation. 

At this early stage of our appreciation of the riches to be mined from a detailed 
exploration of YB, there is nothing much to add except perhaps to stress again 
how closely the notation for and the algebraic operations with polynomials are 
modelled on the the writing of numbers in the decimal place value notation and 
arithmetical operations with them. A polynomial is uniquely characterised by the 
coefficients of powers of the unknown. The unknown (variable) is left implicit just 
as the base 10 is in the decimal notation; it is not assigned a symbol but has a 
name, (avyakta)rasi. The only difference is that the coefficients are from the ring 
of integers and negative coefficients have to be explicitly distinguished whereas in 
the decimal representation of numbers, each coefficient is a nonnegative integer 
modulo 10 — there are no carry over rules in the algebra of polynomials. We may 
nevertheless feel confident in the guess [16] that, for the Kerala mathematicians, 
the decimal numbers served as a bridge not only to the old idea of the infinite, but 


also to the acceptance of infinite series both numerical and functional, just as they 
did for Newton (section 1.4). 


5.2. The Roots of Samskaram, Roots by Samskaram 


How far back in antiquity can we trace recursive computational techniques of 
the sort we have come across in YB, more particularly in their samskaram form? 
There are very credible indications that the idea of making an initial incorrect 
guess at the solution of a problem which is then sequentially refined to any desired 
degree of accuracy is very old indeed, as old as the earliest recorded mathematics 
in India, the Sulbasutra manuals (8th-7th C. BCE) [31]. Three of them, those of 
Baudhayana, Apastamba and Katyayana, have an identically expressed rational 
numerical approximation for the square root of 2 which is most easily and logically 
obtained (the specific arithmetical formula given in the texts is a good guide in this) 
by an exercise in the method of samskaram. The technique also seems to be at the 
back of a rational algebraic formula for the approximate square root of a general 
nonsquare positive integer n found in the much later Bakhshali manuscript. It seems 
best to discuss these together, but in anti-chronological order, beginning with the 
general case. : x 

The period of the Bakhshali manuscript has been a contentious issue, various 
scholars suggesting various dates ranging between the Istand the 13th centuries CE, 
According to Takao Hayashi whose critical edition [15] contains the most thorough 
study we have of the work, it was most likely composed m the pu century. (That 
makes it substantially more than a thousand years younger than the Sulba manuals), 
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Our interest is in one sitra (sūtra Q2 in Hayashi's nomenclature) which is cited on 
more than one occasion in the surviving fragment of the text and which contains the 
formula mentioned above. The literal meaning of the verse is not unambiguous but 
there is unanimity on its mathematical content (see Hayashi [15]): 


2 go 


r , 
m+ x 


r l/r 
ve = — —--(— 
IBS last 2m 2 C 


where m? is the square number nearest to and less than n and r is the difference 
n — m?. Bakhshali of course gives no proof or explanation but it is difficult to look 
at this formula without immediately recognising its samskaram credentials. In that 
spirit, the obvious first guess is x, := „/nı = m so that r is a measure of the error. 
It is easy to reexpress n as the difference of two squares: 


r \2 r X? 
n2 (nec) -(—), 
( 2m 2m 
a type of reordering very popular in Indian mathematics. The next approximation 


X2 is obtained by neglecting the second (and second order) term in the above exact 
formula: 


| 


s 
2m. 


X := y/n = m + 
Now define r’ to be the difference between the exact value and the approximant x: 


(m? +r)? =m+ Rb gis 
2m 


which equation, on squaring and neglecting rÊ but not (r/2m)?, leads to 


EA 
Gs) aea 


and hence to the Bakhshali formula as x3 in this sequence of approximations. The 
question of course is, is this how the author of sūtra Q2 arrived at the formula? 
. Since it is not obviously a result that should be “self-evident to the intelligent” 
(Nīlakanthan, alluding to the theorem of the diagonal), there presumably was a yukti 
and going by the choice of the ‘variables’ (the nearest perfect square and its deviaton 
from the required answer), we can at least tentatively assume that it was a recursive 
one, not very different from the one above. Indeed just such a justification has 
been suggested by commentators, the closest to the method of samskāram (though 
without drawing the parallel) being that of Channabasappa [32]. ; 

The iteration can be carried forward indefinitely by consistently neglecting the 
quadratic term in the error at each stage, again in the general spirit of recursive 
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refining. The result is easily expressible in an explicit infinitely recursive formula — 
a particularly neat and compact formulation is given in Hayashi [15], formula 
(11.1)! What is interesting now is that we can also evaluate y/n to arbitrary 
accuracy by expanding Jn = ((m + r/(2m)y — (r/(2m))2)2 in powers of the 
second term in a (convergent) binomial series. The two series are not the same — 
successive terms have, in general, different structures for the denominators — and 
there is no reason why they should be; they are expansions in different variables. 
Nevertheless, the first three terms of the binomial series coincide exactly with the 
three terms in the Bakhshali formula. The first moral to be drawn from this is 
that the absence of a general binomial theorem was not much of a handicap in 
finding alternative (convergent in this case) infinite expansions, as noted earlier (the 
end of section 3.3). The second comes from the similarity and dissimilarity with 
Madhavan’s interpolation formula (section 4.3) which also agrees with the Taylor 
series in the first two terms: whether a given formal iterative expansion is a valid 
one is to be independently checked in each problem. 

YB does not refer to the Bakhshali formula; it has no occasion to. But given the 
uncanny similarity of the only yukti we know for it to the general philosophy of 
samskaram, we are obliged to make doubly sure that this affinity is not something 
we have imposed on the actual, bare-bones, formula by overinterpreting it. In other 
words, how can we be sure that it was thought of as the starting point of an exact, 
endlessly recursive, procedure as the paragraphs above (and Hayashi’s formula 
(11.1)) suggest, and not just as a clever and empirically reliable approximation 
without a general theoretical basis? It is here that we can turn for help to the 


numerical Sulbasütra formula for 4/2. That formula is (literally reading the verses 
for the various numbers involved) 


Nl 1 
LL E LZ A 


On the other hand we can apply the Bakhshali formula to 4/2 (m = 1, r = 1) to get 


1 1 
PEE 


which is numerically equal to the sum of the first three terms of the Sulbasütra 
formula. Even more strikingly, the next samskaram in our conjectured general yukti 
for Bakhshali leads to a correction term which is —1/408, again identical with the 
4th Sulbasütra term (discounting the particular factorisation employed). 

The book of Sen and Bag describes several suggested reconstructions of the 
Sulbasütra formula for A/2, without mentioning the general Bakhshali formula. 
None of them agrees in every step with our conjectured Bakhshali yukti. But one 


51Jyst to avoid any misunderstanding, such a general formula does not occur anywhere in Bakhshalj, 
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due to L. Rodet, published in 1879, two years before the Bakhshali manuscript 
was even unearthed, comes very close. It starts with the first guess /2 = 1 + 1/3 
(rather than just 1) and in two steps of numerical samskaram gets to the Sulbasütra 
approximation. It is obvious that Rodet chose the first two terms to reproduce 
the 1/3 term and it is equally obvious that the dharma of samskaram imposes 
no conditions on the first guess except those of practical utility (economy) and 
theoretical soundness (convergence, for instance). In any case, despite the numerical 
coincidence seen above, there is no way to be absolutely sure that the putative 
yukti(s) of our two sources were the same. The numbers which appear as the various 
terms and factors may or may not point to a particular method of arriving at the 
result; it is not impossible for example (though it appears unlikely) that the choice 
of the denominators 3,3 x 4 and 3 x 4 x 34, identically expressed in all three 
Sulba texts dealing with the question, was dictated by the demands of metrical 
composition. 

It is also something of a surprise that it is the earlier text that carries the refining 
one step further than the later. Here again, it may well be that the author of sutra 
Q2, having got an accurate enough three-term formula for reasonably small r/(2m), 
was daunted by the task of expressing in verse the fourth term, a fairly complicated 
rational expression involving two variables (m and r) and several numbers, rather 
than by mathematical difficulty. Not every mathematical versifier is a Sankara 
Variyar. 

With reasonable confidence, we may conclude from all this that the idea of 
recursion as captured in the method of successive refining has been a constant of 
Indian mathematical thinking and technique over the two millennia separating YB 
from the late vedic era. The earliest Sulbasutras were probably assembled some time 
after the compilation of the particular version of the Yajurveda containing the famous 
list of powers of 10 and the knowledge they display may well be contemporaneous 
with, or even anterior to, that text. As for Bakhshali, that comes, very likely, soon 
after those number-happy centuries when naming and listing impressively large 
numbers had become an end in itself. (A case can me made for its date to be 
somewhat earlier than Hayashi’s, on the basis of what is reasonably certain about 
the political history of northwest India). The intervening period is surely one in 
which Indian thinkers, in addition to mastering arithmetic, lost their fear of infinity 
as the philosophical and canonical (including the astronomical) literature of Jainism, 
Buddhism and the Hinduism of the time demonstrates. From the recursive infinitude 
of numbers-with-a-base, it is a natural step to power series, a step taken also by 
Newton in his turn and, more generally, to the idea that a recursive computation 
need not end for it to make sense, that a well-defined quantity such as a number or 
the length of an arc can be expressed meaningfully in an unending series of terms. 
And the liberation probably came early — it is notable that all three of the Sulba 
manuals that give the approximate value of /2 end their enunciation of the formula 
with the phrase savisesah, “with remainder”. 
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Thus, born out of the systematics of the decimal number nomenclature and 
marking its first success as a clever scheme of approximation in arithmetical prob- 
lems, recursive thinking acquired such depth and richness with the evolution of 
mathematical ideas that, by the time of Madhavan, it would seem that mathematics 
in every one of its aspects, conceptual, computational and logical, was almost in- 
conceivable except in fundamentally recursive terms. Among the milestones which 
mark this journey three are particularly significant: the decimal numbers and the 
development of arithmetic of course, Aryabhata’s trigonometry and the sine table, 
and the invention and elaboration of calculus. Indeed a reading of Yuktibhasa makes 
one wonder whether Madhavan’s calculus would at all have been possible without 
the power and subtlety that these methods had by then acquired. 

This work has been made possible by a Senior Award of the Homi Bhabha Fellow- 
ships Council and the hospitality of the IUCAA, Pune, to both of which institutions I 
express my thanks. From among friends and colleagues whose specific suggestions 
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Tantrasangraha (c.1500) and 
Ganita- Yukti-bhasa (c.1530) 
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1. Introduction 


Áryabhatiya (c.499) is the first extant text on mathematical astronomy in India to 
discuss the explicit algorithms for the computation of planetary positions [1]. Ec- 
centric/epicyclic models are somewhat implicit in these algorithms in Aryabhatiya, 
and in the texts which followed. In contrast, the planetary theory in Ptolemy’s Al- 
magest [2] and the later Islamic works which were influenced by Ptolemy, as also 
the Copernicus’s de Revolutionibus [3], are based on explicit geometrical models . 
In essence, the motion of any planet is viewed as a combination of uniform circular 
motions in these models. No such rigid principles are insisted upon in Aryabhatiya 
or even in the later Indian texts. Though most texts do not discuss the geometrical 
models at all, there are exceptions. Nilakantha Somayaji who introduced a major 
revision of the traditional Indian planetary theory in his Tantrasarigraha [4,5,6], 
outlined a geometrical model of planetary motion in his other works, Golasara, 
Siddhantadarpana and Aryabhatiya-bhasya. In this model, planets move in eccen- 
tric orbits inclined to the ecliptic around the mean Sun, which in turn goes aound the 
earth [7,8]. We have a detailed account of this model in Ganita-Yukti-bhasa, pop- 
ularly known as Yukti-bhàsa (c.1530), authored by Jyesthadeva [9]. Here onwards, 
[9] will be indicated by just Ganita-Yukti-bhasa in all the specific references to the 
passages in it. 

Apart from discussing the geometrical picture of planetary motion, Yukti-bhasa 
also presents an interesting geometrical model of lunar motion while discussing 
the so called ‘dvitīya - sphuta' or the “second correction’. This correction termed 
‘Evection’ was first introduced by Ptolemy in his Almagest, where he proposed a 
geometrical model for Moon's longitude, which incorporates evection, apart from 

` the correction due to the eccentricity of the orbit. Ptolemy's lunar model had a glaring 
defect as the ratio of the maximum and minimum apparent angular diameters of the 
Moon would be around 1.92 in this model, which is patently absurd. This deficiency 
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of Ptolemy's lunar theory had been rectified in some later Islamic works which other- 
wise follow the Ptolemaic framework, as for instance in Ibn al-Shatir’s (1304-1375/6 
CE) lunar theory, which seems to have been adopted by Copernicus also, except for 
small change in parameters [10]. In the Indian tradition, the second correction to 
the Moon makes its first appearence in Laghumanasa of Mañjulācārya (c.932) 
who essentially gave an analytical form for it'[11]. Tantrasarigraha also gives the 
algorithm for the second correction which has the same form as in Laghumanasa, 
but goes ahead to give the formula for the true distance of the Moon, incorporating 
this. We have an explcit geometrical model for the same in Yukti-bhdasa. This model 
does not show any problem of unresonable variation in the earth-Moon distance 
that is associated with Ptolemy's lunar theory. The corrections to Moon's longitude 
as given in the lunar theories of both Tantrasarigraha/Yukti-bhasa and Ibn al-Shatir 
compare reasonably with the modern expression for the two corrections, though the 
geometrical models are very different. Also, the patterns of variation of the lunar 
distance are different in the two models. 

In section 2, we discuss the salient features of planetary computations in Indian 
siddhantas begining with Aryabhatiya. In section 3, we provide a brief outline of the 
contents of Tantrasarngraha and Yukti-bhasa. The planetary model in Yukti-bhasa is 
described in detail in section 4. For the sake of comparison, the geocentric position 
of a planet in Kepler's model is discussed in Appedix 1. In section 5, we discuss the 
lunar theory of Yukti-bhasa and briefly compare it with the lunar thories of Ptolemy 
and Ibn al-Shatir which are outlined in Appendices 2 and 3 respectively. 


2. Planetary Computations in Indian Siddhantas 


The Siddhànta period in Indian astronomy [12] begins with Aryabhatiya which 

E was composed in AD 499. There were indeed earlier siddhantas which have been 
ze quoted by the later astronomers but are not available today. Mathematics devel- 
oped with astronomy in India. Siddhantas contain correct mathematical solutions 
to astronomical problems, namely calculation of positions of planets (Sun, Moon, 
Mercury, Venus, Mars, Jupiter and Saturn), and diurnal problems involving spher- 
omy, and eclipses. The diurnal problems include, finding north-south 
atitude of a place, Sun's diurnal path, its declination, determination of 
sunrise/sunset, measurement of time (from shadow), relations among 
col einas calculation of lagna (point on the ecliptic which i ison 


AB half-durations (from the first contact to ine amicale 
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Table 1: Planetary revolutions in a Maháyuga as given in Aryabhatiya, and the 
inferred sidereal periods. * The revolution numbers of Mercury and Venus refer to 
their sighroccas (heliocentric values). 


No. of Revolutions Period of Revolution 


| Planet | inaMahayuga in days (Sidereal period) | Modern value 
Sun | 43,20,000 365.25868 365.25636 
Moon 5,77,53,336 27.32167 27.32166 
Moon’s apogee 4,88,219 3231.98708 3232.37543 
Moon's nodes 2,32,226 6794.7495] 6793.39108 
Mercury* 1,79,37,020 87.96988 87.96930 
Venus* 70,22,388 224.69814 224.70080 
Mars 22,96,824 686.99974 686.97970 
Jupiter 3,64,224 4332.27217 4332.58870 
Saturn 1,46,564 10766.06465 10759.20100 


and from the middle to the last contact), duration of totality and the magnitude of 
the eclipse, and so on. 

Aryabhatiya is the earliest available siddhanta text in India, which contains 
a systematic account of most of the traditional astronomical problems [1]. It is 
mentioned in the text itself that it was composed 3600 years after the beginning 
of Kaliyuga corresponding to AD 499 when it is stated that the author was 23 
years in age. Aryabhata composed this work in Kusumapura, which is the same 
as Pataliputra (near modern Patna). From Aryabhatiya onwards, one considers a 
Mahayuga of 43,20,000 years. The number of revolutions in the stellar background 
made by the planets in the Mahayuga are given in all the texts (see Table 1). 

In the case of Mercury and Venus, the texts give the revolution numbers of 
their sighrocca-s, which are nothing but their heliocentric revolution numbers. The 
number of civil days in a Mahayuga known as the Yugasavanadina is also specified. 
In Áryabhatiya, its value is given as 1,57,79,17,500. 

From the above data, the mean longitudes of the planets can be calculated for any 
time. Normally, it is assumed that the mean longitudes are zero at the beginning of 
kaliyuga. In Aryabhatiya, this is taken to be the mean sunrise at Ujjain on February 
18, BC 3102. 

Now, the apparent motion of the Sun, Moon and planets in the background 
of stars is not uniform. The planets (including the earth) and the Moon move in 
elliptical orbits around the Sun, and the earth respectively. The ellipticity of the 
orbit gives rise to the non-uniform nature of the motion. This is essentially Kepler’s 
picture. In the geocentric frame of reference, the Sun moves around the earth, and 
the geocentric longitude of the Sun would be the heliocentric longitude of the earth 
increased by 180°. The geocentric longitudes of the planets would be related to their 
heliocentric longitudes and the geocentric longitude of the Sun. 
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Even in the ancient times, the predictions of planetary positions were fairly 
accurate. This means that the ancient models would have in some sense been 
approximations of the Kepler model in the geocentric frame [13]. In the Indian 
planetary theories, two corrections were applied to the mean planet to obtain the 
‘true’ geocentric longitudes, and they are described below: 


(1) Manda-samskàra: This takes into account the non-uniformity of motion 
resulting from the eccentricity of the planet's orbit. An epicyclic or eccen- 
tric model is used to obtain the manda-correction. This corresponds to the 

“Equation of centre’ in modern terminology. The manda-corrected mean 
planet is called mandasphuta-graha or simply mandasphuta. This is the 
true geocentric longitude in the case of the Sun. For the Moon, a second 
correction has been specified in some later texts to obtain the geocentric 
longitude, which will be discussed later. In the case of the planets called 
taragraha-s (traditionally, only Mercury, Venus, Mars, J upiter and Saturn), 
the mandasphuta is the true heliocentric longitude. 


(2) Sighra-samskara: For the planets, one more correction, namely srghra, has 
to be applied to obtain the true longitude called sphutagraha. The sighra- 
samskara converts the heliocentric longitudes of the planets to geocentric 
longitudes. 


In Aryabhatiya, the above two corrections have been discussed clearly. The 
planetary model described by Aryabhata roughly amounts to the planets orbiting 
around the Sun in eccentric orbits, with the Sun itself orbiting around the earth, 
though Aryabhata does not state it as such. We discuss the Indian planetary models, 
in paer ie traditional confusion in taking the mean Sun as the mean planet i in 


s of planets i is broadly correct in Ae 

sntion here some of the later astronomers and etis ma- 

Varahamihira (b AD 505) (Paricasiddhantika), Bhaskara I 

riya, Aryabhatiya-bhasya), Brahmagupta (7th cen- 
la Paaisiedtheentiry), (Sisyadhivrddhida-tantra), 
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some major innovations in planetary theory. We will discuss them in the next 
section. 


3. Tantrasangraha and Yukti-bhasa 


Kerala was a stronghold of Aryabhatan school of astronomy [14]. Áryabhatiya be- 
came popular in Kerala from early times. Madhava of Sangamagrama (1340-1425) 
seems to have been the founder of the Kerala school. His known works are few 
in number. However, most of the distinctive contributions of the Kerala school 
in mathematical analysis, as well as in planetary computations are invariably at- 
tributed to him by the later astronomers from Kerala. Parame$vara of Vatasseri 
(c.1360-1455), Nilakantha Somayaji or Somasutvan of Trikkantiyur (c.1444-1550) 
and Jyesthadeva of Parakroda or Parannottu family (c.1500-1610), are some of the 
major figures of this school. Emphasizing the need for revising the planetary para- 
meters through observations, Parame$vara introduced the Drgganita system. Apart 
from Drgganita, his other major works are Bhatadipika which is a commentary on 
Aryabhattya, Siddhantadipika, and Grahanamandana on eclipses. He was one of 
the first astronomers to discuss the geometrical model of planetary motion in some 
detail. 

Tantrasangraha, composed in AD 1500, is Nilakantha's major work. 
Áryabhatiya-bhásya composed by him late in his life is perhaps the most elab- 
orate commentary on Aryabhattya, and is yet to be translated and studied in depth. 
Jyotirmimamsa, Siddhanta-darpana and Golasara are some of his other important 
works. Jyesthadeva is the author of Ganita-yukti-bhasa. He authored one more work 
called Drkkarna. 


Tantrasangraha 


Tantrasangraha is a comprehensive text which discusses all aspects of mathe- 
matical astronomy such as the computation of longitudes and latitudes of plan- 
ets, various diurnal problems, determination of time, eclipses, visibility of planets 
etc. However, explanations are not provided, save on some rare occasions, as this 
belongs to the Tantra class of texts which are intended to be computational in 
nature. The explanations of the algorithms are to be found in Yukti-bhasa, and 
in the two commentaries of the text, namely Laghuvivrti and Yukti-dipika, both 
by Sankara Variyar of Trikkutaveli (c.1500-1562). In what follows, we sum- 
marise those portions of Tantrasarigraha which deal with planetary and lunar 
theory. 

The revolution numbers of the Sun, Moon and the five planets given in this text 
differ little from the Aryabhatan values summarised in Table 1. The yugasavanadina 
or the number of civil days in a yuga is also the same as in Aryabhatiya. However, it 


CC-0. Gurukul Kangri Collection, Haridwar. An eGangotri Initiative 


^ O— EBENEN Rm G8 Á 9 = 2 ' T "EE 


Pe t git e B I UR rt tte 


358 M. S. Sriram 


is noteworthy that while specifying the number of revolutions of the interior planets, 
Mercury and Venus, the word svaparyayah is used, which clearly means that the 
revolution numbers given refer to their own revolutions, and not to their sighroccas, 
as specified in the earlier texts. This is of great significance in the computation of 
longitudes of the inner planets, and will be explained later. 

The revolution numbers of the apsides and nodes of the planets which are needed 
for calculating the true longitudes and latitudes are also given. The mean longitudes 
at the begining of kaliyuga are not assumed to be zero, as in Aryabhatiya, and their 
corrected values are specified. The mean longitude of a planet can be calculated 
at any given time, given these epochal values, the revolution numbers and the 
yugasavanadina. 

In the earlier Indian texts, for the interior planets Mercury and Venus, the 
equation-of-centre-correction was wrongly applied to the mean Sun, instead of 
the mean heliocentric planet. This is true of Ptolemy's A/magest also, and clearly 
this correction does not correspond to anything physical. It was Nilakantha who 

set this right in Tantrasarigraha. Here the manda correction is applied to the mean 


. heliocentric planet for the interior planets also, just as in the case of exterior planets. 


This departure, as well as a clear analysis of the latitudinal motion, led Nilakantha 
to propose a geometrical picture of planetary motion in his other works. According 
to this, the planets move in eccentric orbits around the mean Sun, which itself moves 
around the earth [7,8]. 

The actual distances of the Sun and the Moon from the centre of the earth play 
an important role in eclipse calculations, as these determine the apparent sizes 
of the solar and lunar discs, as well as their parallaxes. This requires accurate 
computations of the positions of the Sun and the Moon. It is in this context that 
nd correction for the Moon is discussed in Tantrasangraha. There is a 
ction for the Sun ,which does not alter its longitude, but modifies its 

doma The mean distances are specified in the text. The true 

ed dvitiya-sphuta-yojana-karna is to be computed by including the 

ion and the above second correction. 
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Yukti-bhasa can be naturally divided into mathematics and astronomy portions. 
The mathematics part is really an independent treatise. A substantial part of math- 
ematics known to Indians at that time is included and explained in the work. The 
famous results on the infinite series for z , inverse tangent, sine and cosine functions 
are all proved here. It also includes a derivation of the surface area and volume of a 
sphere using methods which amount to carrying out integrations. 

The astronomy part is very comprehensive, and includes almost all that was 
known in India then, save some topics like instruments. The author's stated aim is 
to explain all the algorithms in Tantrasargraha. 

The first chapter in the astronomy part is on planetary theory. The epicyclic and 
eccentric theories are described for both manda-samskara (equation of centre) and 
Sighra-samskara (conversion from heliocentric to geocentric coordinates). The 
correct formulation of the equation of centre for interior planets is presented and 
the way it differs from the earlier formulations is noted. The geometrical model 
of Nilakantha is described in detail. The computation of geocentric latitudes is 
also described. The orbit of a planet has to be projected onto the plane of the 
ecliptic to obtain the longitude. This is explained. There is an interesting discussion 
on how one can obtain the geocentric coordinates of a celestial body, given the 
luni-centric coordinates. This indicates how comfortable Jyesthadeva was with 
coordinate transformations while going from one frame of reference to another, in 
the context of astronomy. 

The second correction for the Moon is dealt with in a later chapter. The 
Tantrasangraha results for this correction as well as the distance of the Moon 
are explained with a geometrical model. 


4. Planetary Model in Yukti-bhasa 


Manda-Samskara- Correction Due to Eccentricity of the Orbit in Indian 
Planetary Models 


We consider the explicit geometrical construction for computing the correction 
due to eccentricity described in Yukti-bhasa [15]. Essentially the same form of the 
correction term has been described right from Aryabhatiya downto Tantrasangraha. 
The speciality of Yukti-bhasa lies in its detailed geometrical description. 

In Figure 1, the dashed circle, with O as the centre is the deferent circle, termed 
‘kaksyavrtta’ . Its radius is ‘trijya’, denoted by R. In the Indian texts, the distances in 
diagrams associated with planetary motion are expressed in minutes. Normally, the 
circumference of the deferentcircle is taken to be 2 1,600 minutes, and thus the radius 
R would be the number of minutes in a radian, which is nearly 3438 . Throughout 
this article, this is the value of R, wherever it appears. A represents the direction of 
the apside (‘mandocca’) and let TOA = w. The mean planet Po moves alon "s 
deferent circle uniformly. The mean longitude is called ‘madhyamagraha’ rhe 
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Pratimandala 
or 
Grahavrtta 


dns Om =O Po.” Constructacircle of radius r with Po as the centre. This is the 
E picycle (*manda-nicocca-vrtta? or ‘manda-yrtta’) on which the ‘manda’ - 
corrected planet (‘mandasphuta’) is located. Draw a line from Pp parallel to OA, 
tersects e leat P. Thisi is the location of the manda-corrected planet 

by 6; 


- elegant geometrical method [16]. Th 
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Draw P Q perpendicular to (extended) O Po. 
TOA = 04 — w = M is the 
TOP= Ou = Oms. Now 


Now, P Py =r. P ĤO - TÓP; — 
anomaly or manda-kendra. and POP) E TOP, - 


_ PO =r sin(P ÊQ) = OP sin(PO Py). (1) 
K = OP is the ‘manda’ -hypotenuse, called ‘mandakarna’ . Yt is easily seen that . 
K = [(R + r cos M) +r? sin? My. (2) 


Yukti-bhasa follows the standard rule adopted in most texts of Indian astronomy 
(such as Aryabhatiya, Brahmasphutasiddhanta, Tantrasangraha, etc.), that r is not 
a constant but proportional K, such that ik 


w} 


the manda-epicycle, and could itself depend on the ani 
K depends on r, and r is proportional to K. In 
iterative procedure is given to find K K anb 
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Taking (3) into account, (1) gives 


sin(Üj — w) 'o sin M 
sin(Oys — 04) = — qM -— — nL. (5) 


Thus, one does not need to know the manda-hypotenuse in order to find the 
correction Os — Oy. As the mean longitude, 0; which varies linearly with time 
can be readily found at any instant, and apside, w is known, the ‘manda’ -correction 
(‘mandaphala’), Oms — Om can be found from the above equation. Adding this to 
the mean longitude 0m, we obtain the ‘manda’-corrected planet, Ows. This is the 
true longitude or the *sphutagraha' in the case of the Sun, and essentially so also 
for the Moon. 

The manda-corrected planet is the true heliocentric longitude for the planets 
Mercury, Venus, Mars, Jupiter and Saturn called ‘taragraha’s, as we will see shortly. 
One more correction, namely ‘sighra’-correction (‘sighra-samskara `) has to be 
applied to them to obtain their geocentric longitudes. 

When 3 « 1, sin(ys — 8m) © Oms — Ou and (5) reduces to 


rosin(Oy — ur ro sin M 
Os — Oy =- EROR E) LL TOR (6) 


Comparing this with (35) in Appendix 1, it can be seen that this has the same form 
as the equation of centre correct to lowest order in eccentricity in Kepler's model, 
with 75 playing the role of eccentricity, e. This equivalence is only to the first order 
in e. The equation of centre has different forms in higher orders of e in Kepler's 
model and Indian models. 


_ InPtolemy's model for the equation of centre, the planet is taken to be moving on 
the “eccentric circle’ with O’ as the centre alright, but it would be moving uniformly 
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Figure 2: Sighra-samskara for exterior planets to obtain the geocentric 
longitude. 


is a circle with the sighrocca as the centre. The planet’s apside (mandocca), U 
is located on this circle. The planet P is located on the eccentric circle which is 
centered at U. K = SP is the manda-hypotenuse. PST is the manda-corrected 
planet, which is the true heliocentric longitude. PSB which is the difference 
between this and srghrocca is known as Sighra-anomaly (Sighra-kendra). TOP 
is the ‘sighra’-corrected planet (‘sighra-sphuta’). The Sighrocca, S is the mean 
Sun, and the stghra-epicycle represents its orbit around the earth, O. The manda- 
corrected planet is the true heliocentric longitude of the planet around 
and the sighra-corrected planet, [OP is the geocentric long 
with respect to the earth, o. It is in this sense that the d 
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1 
| We now employ the following notation: 


i mean longitude (madhyama-graha) — TUP = 6y, 
| manda-corrected planet (manda-sphuta) — V$P = Oxys, 
| manda-hypotenuse (manda-karna) = SP = K, 
K 
radius of the manda-epicycle = U S = rom = A 
mean Sun (sīghrocca) — TÓS = 6s, 
radius of the srghra epicycle = OS = rs, 
Sighra-anomaly (sighra-kendra) = PSB -T$P -TB = Oy S — Os, 
Sighra-hypotenuse (Sighra-karna) = OP = K,, (7) 
Sighra-corrected planet (Sighra-sphuta) =TOP = — 6,. (8) 


Here the manda-corrected planet, 07.5 and the manda-hypotenuse, K are determined 
in terms of the mean longitude (0), the apside (w), and the prescribed value of 
ro, as described in the previous sub-section?. The following formula for 0, can be 
easily derived from the geometrical construction in Figure 2: 


uc r, sings — 8 

sin@, — Gus) ED : 

-- s 

^ r zas rs Sin(Dus — 05) 

——— KK +r, cos@ms — 05)? + r2 sin (Qus — 65)]/2- 


ac 2 
orm 


Il 


(9) 


ula in earlier texts also, such as Suryasiddhanta, Mahabhaska- 
mula akti-bhasa is exactly the same as would follow from 
ppendix 1) for an exterior planet, when Os 
true heliocentric longitude in the latter (6,), 
ith % in the latter. The identification of Qs 
the heliocentric mean planet to which the cor- 
ded. T ere is a small error as 6, is the 
OQ nt in Ptolemy's Almagest [2] 
pernicus [3]. In Table 2, we 
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Figure 3: Sighra-samskara for interior planets to obtain the geocentric longitude, 
in Aryabhatiya. 


Interior Planets 


In both the Indian and Greek traditions, the mean Sun was taken to be the mean 
planet for the interior planets. This is due to the fact that the directions of Mercury 
and Venus are close to that of the Sun, as viewed from the earth. So, their geocentric 
longitudes would not be very different from the longitude of the Sun. The procedure 
for the computation of the geocentric longitude of an interior planet in Aryabhatiya 
and other texts prior to Tantrasarngraha, is represented geometrically in Figure 3. 
Here $ is the mean Sun moving in a circle around U, which is the apside of the 
planet, where OU is equal to the radius of the manda-epicycle of the planet. Thus, 
S is nothing but the mean Sun, to which the manda-correction pertaining to the 
planet has been applied. P is the sighrocca which is actually the mean heliocentric 
planet, moving on the sighra-epicycle, which is the planet's orbit around S.^ Then 
what is computed as the geocentric longitude of the inner planet in the earlier 
texts, is actually TÔ P. This is obviously wrong. Fhe manda-corrected planet rós 
here does not correspond to anything physical. The same mistake was committed 
by Ptolemy in his Almagest [2] and even by Copernicus [3], who seems to have 
followed Ptolemy closely. 

Nilakantha Somayaji seems to have been the first astronomer who revised this 
picture in Tantrasangraha and his other works [7,8]. His revised formulation is what 
is described in Yukti-bhdsa [19] and depicted here in Figure 4. Nilakantha has a 


^In Aryabhatiya and other ancient texts, for the interior planets, the sighrocca and the Sighra-epicycle 
represent entirely different things, when contrasted with the same objects for exterior planets. For the 
latter, the srghrocca was the mean Sun and the sighra-epicycle was mean Sun's orbit around the earth. 
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e Sighrocca, in the same way as for exterior planets. $ moves on a circle of radius 
|j the centre of the earth, O as the centre. This is the Sighra-circle (Sighra-vrtta 
ghra-nicocca-vrtta) for the planet. The planet P moves on an eccentric orbit 

. The planet's gistance from S, SP is the manda-hypotenuse with its value 


mae respect to S. This is the true heliocentric longitude of 
A e also shown the apside (mandocca), U. U P = r, and 
du sed ré dius c of the manda-epicycle. The smaller circle 
bit of P around S, whose radius is 7,. TÔ P is the 
hich is th ne geocentric longitude . We now summarise the 
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Planet's distance from S, reduced manda-hypotenuse = SP =7, = ne 
Distance between apside and sighrocca=US = xn 


Sighra-anomaly (sighra-kendra) = PSB =TSP—TSB = Ous — 9s, 
Sighra-hypotenuse (sighra-karna) = OP 
Sighra-corrected planet (sighra-sphuta) =TOP = b. (11) 
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In the case of the exterior planet, the mean Sun which is the sighrocca moves in 
a smaller circle of radius r, and the planet moves in a circle of radius equal to trijya, 
R around the apside. Here for an interior planet, the mean Sun is the Sighrocca, 
which moves in the bigger circle of radius R, and the planet moves in an eccentric 
orbit, such that it is at a distance r, from U, and F, from S. The manda-corrected 
planet , 475 is calculated from 8y in the usual fashion. The sighra-hypotenuse, Ks 


is given by 
Ks = [(R + F, cos(@us — 05). + (Fs) sin? (Ous — Os)”. (12) 


Then the geometry of Figure 4 implies the following formula for the true geocentric 
longitude, 8, : 


Fs sinus — 05) 


sin(@, — 6;) K 
5 


r; sin(fys - = zs) - eue s 0» 


uc a * 


n i — 


Sure and earth uen 
of the room 


368 M. S. Sriram 


Table 2: Comparison of 5 in Tantrasarigraha for 
Sighra-samskara with the modern values of the mean ratio 
of Earth-Sun and planet-Sun distances for exterior planets 

and the inverse ratio in the case of the interior planets. 


Planet — [ Tanirasarigraha value 


Mercury 
| Venus 


Mars 
Jupiter 
Saturn 


“In the above set up, the srghra-vrtta is presumed with its centre at the centre 
of the apakrama circle Sand its circumference along the márga (in the plane) of 
1 the apakrama circle. It may be recalled that the apakrama circle near the centre is 
called sighrocca-nica-vrtta. The size of the sighra-vrtta-s will be different for the 
different planets. That is all the difference (between the sighra-vrtta-s) and there is 
no difference in their placement as they are located the same way (i.e., with their 

iH centre at the centre of the apakrama circle and also lying in the same plane). 
"Now, the manda-nicocca-vrtta is a circle having its centre on the circumference 
of the Sighra-vrtta at the point where the mean Sun is. This is the case for all 
(the planets). The ascending node (pata) has its motion along the circumference 
of the manda-nicocca-vrtta in the retrograde manner. The point in the manda- 
nicocca-vrtta where the pata is, will touch the apakrama-mandala. One half of the 
manda-nicocca-vrtta, commencing from the pata will lie on the northern side of 
the apakrama-mandala. Again, the point which is six signs away from the pāta 
ill touch the apakrama-mandala. The other half (of the manda-nicocca-vrtta) 
1 the southern side of the apakrama-mandala. Here, that point, which is 
maximum from the (plane of the) apakrama-mandala, will indicate the 
epa) of the planets in terms of the minutes of arc 


En > M TE 


S. 
dala, Hence, the pratimandala too will be inclined 
om the | plane of Git -mandala in accordance 
manda-karna-vrtta? will also be inclined accordingly. 


inda-hypotenuse. 
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Figure 5: The latitudinal deflection (viksepa) of a planet . 


the Rsine of the manda-sphuta minus pata is multiplied by the maximum viksepa 
and divided by trijya, the result will be the ista-viksepa of the planet on the manda- 
karna-vrtta. This inclination (deflection from the ecliptic) is called viksepa.” 

This has the following implication for the geometrical model for the planets. 
In Figure 2 (for the exterior planets) and Figure 4 (for the interior planets), the 
mean Sun, S (sighrocca) moves on the Sighra-circle in the plane of the ecliptic 
(apakrama-mandala), with the earth, O as the centre. The planet, P moves around 
S inan eccentric circle with the apside, U as the centre, in a plane which is inclined 
to the ecliptic. SU and U P are in this plane. First, let us consider the heliocentric 
longitude. In Figure 5, O is the earth and S is the Sun moving around the earth in 
the plane of the ecliptic. P is the planet moving in a plane inclined to the ecliptic, 
and K — SP which lies in this plane is the manda-hypotenuse. The ecliptic, and 
the orbit of the planet intersect at N, which is the ascending node. 

The latitude of the planet is  — P Q whichis the arc perpendicular to the ecliptic 
passing through P. P B is perpendicular to the plane of the ecliptic. Then the the 
distance of the planet from the plane of the ecliptic,termed ‘viksepa’ is given to be 


viksepa =v = PB = K sin. (14) 
£ itself is found from the following expression: 
Rsinf = R sini sin(Oys — Oy), 
where Omus is the true heliocentric longitude of the planet (manda-corrected planet), 
On is the longitude of the ascending node of the planet, and i is the inclination of 
the orbit. Figure 6. shows the relation between the geocentric latitude f and the 
heliocentric latitude £ as explained in Yukti-bhasa. We have 


PB = OPsinf = SP sinf, (15) 
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Sighra—karna 
in the plane of 
Viksepa—koti—vrtta 


Figure 6: Geocentric latitude, B. 


Or, 
USD 
inĝ = —— sin f. (16) 
sin f OP p 
When £ and f are small, the above relation reduces to 


VEU. (17) 


OP is the earth-planet distance (bhit-tara-graha-vivara), while SP is the Sun- 
planet distance. Equation (17) is exactly the same as the relation (5 1)in Appendix 1 
on Kepler's model, relating the geocentric latitude Bz and the heliocentric latitude 
ps- Here it may be pertinent to recall that the theory of latitude in Ptolemy's Almagest 
was totally unsatisfactory, as it took the planes of the orbits to be passing through 
the earth rather than the Sun [2]. The same was the problem with de Revolution ibus 
of Copernicus also [3]. : 

The latitudinal deflection affects the longitude also. The manda-hypotenuse, 
SP = K, should be projected onto a plane passing through the planet, P which 
is parallel to the ecliptic. S’P = K cos, which is the projection is termed 
viksepa-koti (See Figs.(5) and (6)). It is pointed out that the geocentric longitude, 
Sighra-corrected planet, should be calculated taking the viksepa-koti as the manda- 
hypotenuse. The result is the true planet (graha-sphuta). This is the “reduction to 
the ecliptic” which was taken note of by some of the Islamic astronomers and later 
discussed by Tycho Brahe in late sixteenth century. 

In summary, we may note the following important points as regards the devel- 
opment of planetary theory in Kerala texts: 


1. The algorithms for the geocentric longitudes of planets given in Aryabhatiya 


and other ancient texts more or less simulated the results of the Kepler model, 
but for the fact that the equation of centre for interior planets was wron g. 
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2. Nilakantha's revised formulation ofthe equation of centre for interior planets 
led him to an unified treatment of exterior and interior planets. The dde 
articulation of this revised model is to be found in Yukti-bhasa. The planets 
move in eccentric orbits around sighrocca (mean Sun), whicl itself goes 
around the earth. The formulae for the geocentric longitudes and latitudes 


of planets here are essentially the same as in Kepler model. 


. Ptolemy’s model for planetary motion is more complicated. Apart from the 
error in the formulation of equation of centre of interior planets, 


the theory 


for planetary latitudes is totally off the mark in his Almagest, where Ptolemy 
gives different models for exterior and interior planets. He revised his model 
for latitudes in his Handy tables, and later in the Planetary hypothesis [22]. 
His formulation in the later work can be taken to be his last word on the 
subject. Here, in the case of the exterior planets, the eccentric (around the 
earth) is inclined to the ecliptic. The epicycle which represents the motion of 
the Sun is parallel to the ecliptic. The model for the interior planets is more 
complicated, but is a vast improvement over the same in Almagest. The final 
results for the latitudes in Planetary hypothesis are close to those following 
from the modern heliocentric theory, though they are complicated, and are 
not equivalent to Kepler’s model in any simple manner. In fact, the same 
is true with respect to the Copernican model also, revolutionary as it was in 


treating the earth also as a planet. 


It should be mentioned that the geometrical model described in Yukti-bhasa 


is for each individual planet separately. It does not give an unified picture 


involving all the planets, to a single scale. 


5. Lunar Theory in Yukti-bhasa 


Yukti-bhasa describes a geometrical pict 

tes the 'second correction' Or C 
Ky interesting geometrical picture, according to x 
is not the same as that of the celestial sphere (bhago ss 
tion with respect to the earth in such a way that it is way 
direction or opposite to it, but at a variable sae 
Moon, calculated using the procedure described earlier, 


LLL 


SThe Handy tables and Planetary hypothes: 


mers in 
the theory 


is seem to have been totally 


d Isl; 
of latitudes) by all the astrono! the Greco-European an 


of Kepler [22]. Ta A 
91tmay be noted that the geometrical pice: y ps moe E s ee 
from the model suggested recently by Duke [24], possible metri 


Ex inla à 
second lunar correction discussed by Manjulacarya in ighumanasa. 
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Bu 
(Mesadi) 


pr 
(Mesadi) 


Figure 7: Geometrical model for the second correction to the Moon 
in Yukti-bhasa. 


longitude with respect to the centre of the celestial sphere. A second correction 
has to be applied to the manda-corrected Moon to obtain the true longitude of the 
Moon with respect to the centre of the Earth, termed the *second-corrected planet’ 

_ Cdvitiya-sphuta"). 
: .. The procedure for the second correction is similar to the calculation of the 
-manda-correction with the centre of the celestial sphere serving as the equivalent 
f the apside, and called just ‘ucca’, which is taken to be in the direction of the Sun 
í between this and the centre of the Earth, which is the radius of the 
d to be a continuously varying quantity given by R cos(8s — w) 
e Os and w are the longitude of the Sun and the apogee of 
respectively. According to the Aryabhatan school, the mean 
on and the centre of the earth is 10R = 34380 yojanas. 
e Moon and the earth will be 10K, where K is the 
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longitude, Oy and the apogee w are not shown in the figure. CP = Dj = 10K 
where K is the manda-hypotenuse in minutes. It is clear that oĉ i 6 = 0. i 

OP, the "second-corrected hypotenuse’ (‘dvitiya-sphuta-karna’) in jaiei 
the true distance between the Moon and the centre of the Earth. The bhu ja-phala. 
ON and koti-phala, CN are given by l EC 


ON = r sinus- 05) 


R 
= 3 cos(0s — w) sin(@ys — 05), (19) 
and CN = racos(Üys — w) 
R 
= p cos(Us — w) cos(Üys — 05). (20) 


Then, the true distance is given by 


X(GNy + ON? 
V(PC + CN)? + ON? 


V (D + koti-phala) + bhuja-phala? 


2 
= (0 + Š ess — w)cos(Oyus — &) 


OP = D: 


1 

2] 
+ (5 cos(0s — w) sinus = és) | , (21) 

Here 
10R? (22) 
=p 106 = 0 s 
Chee [R2 — rê sin? M — ro cos M 
where M = 8m — @ is Moon’s anomaly, and ro is tonnes a 
‘ si i i ion on the equation ot cen 'in sec 3 

epicycle introduced in bere uses the above geometrical picture of lunar 


We now explain ho 
motion to incorporate 
Moon. In Figure 7, 0, 
the centre of the earth. 
drawing OP’ = Di In 


1 ion' i the longitude of the 
called ‘evection’ correction to 

o P is the true longitude of the Moon with si v 
The true position of the Moon at P can also be ea z 
the direction of the manda-corrected planet WI respect 


P', in the direction of the Sun. 
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Now 


OPC = PÓP' TÓP' -TÓP -TÓP —TÓP = ĝus — 6. 


Hence 
UA ON | &cos(0s — w)sin@ys — S 
sin8us — 8) = sinOPC) = 55 = GUTES (23) 
where D» is given by (21). Hence 
x 0, —0 A sin"! R cos(@s — ur) sin(Oys x 0s) 
t = UMs — c 
in M 
=f ES eec ro Sin 
M sin ( R 
We E cos(Os — z) sin(£ys — zl y 
D 


Now when x < l,sin"! x =~ x (radians) = 3438x (minutes). ? for ue Meo is 
specified to be 4 go in Tantrasarigraha. Then to the first order in 2 and n x 5 we 
obtain the true nib of the Moon to be 


[onines = Om — 300.82’ sin M — 171.9' cos(0ş — w) sin(@ys — 0s), (25) 


e Oy is the Mean longitude of the Moon in minutes, the second term is the 
sua eq tio of centre and the other term is the second correction as prescribed i in 
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where ws is the longitude of Sun's apogee. In the RHS, the second and third terms 
represent the *Equation of centre', and the fourth, fifth and the sixth ones are the 
‘Evection’, ‘Variation’, and ‘Annual inequality’ repectively. Part of the second term 
and the fourth term can be combined using the relation: 


"Jet 


Then, writing only the terms corresponding to the two inequalities, namely, *equa- 
tion of centre’ and ‘evection’, 


; : . [A 
sin A + sin B = 2sin ( x 


0,(Modern) = Əm — 300.9 sin M + 12.82 sin(2M) 
152.86 cos(@s — w)sin(0ys — 05) + Variation 
+ Annual inequality + ..... (27) 


We may now compare the expression for 8, in Tantrasangraha and Yukti-bhasa to 
the first order in 2 = , and = with the modem expression, and with the one which 
follows from the geometrical model of Ibn al-Shatir (outlined in Appendix 3). The 
coefficient of the sin M term in TantrasangrahalYukti-bhasa formula (25) has very 
nearly the same value as in the modern expression (27). Ibn al-Shatir's value for 
the same is 296.05'. The evection term in (25) has the same form as in the modern 
expression , though with a higher coefficient of 171.9’. In fact, the Laghumanasa 
value of 143/58" seems to be better. In Ibn al-Shatir's theory this has the value 
162.35'. It should however be remembered that we have retained only the first 
order terms in the Tantrasarigrahal Yukti-bhasa expression, as also in Ibn al-Shatir's 
theory. There is no sin(2M) term in Hume ue Yukti-bhasa. This is true even 
if we include the higher order terms in 3. However the coefficient of this term is 
small even in the modern theory. 

It is possible to compare the exact values of the correction to the mean longi- 
tude due to equation of centre and evection, without any approximation, at syzygy 
(Ay — 8s = 0? or180?) or quadrature (Om — 0s = +90°), when it is also assumed 
that the anomaly, M = 90°. At syzygy, the second correction is zero in 
Tantrasangrahal Yukti-bhasa, and when the anomaly is 90°, 6, —6y = sin~' (45) = 
301.2’, compared with the modern value of 300.9’, Ptolemy's value of 301’ and Ibn 
al-Shatir’s value of 295’. At quadrature, when the anomaly is 90°, it is easy to check 
that D; = 10.038R and D; = 10.05R. Then 6, — y = sin“! ($) —sin-! (4) = 
472.3’, compared with the modern value of 453. 8’, Ptolemy’s iud of 460' and Ibn 
al-Shatir’s value of 456’. 

Hence, the Tantrasarigrahal Yukti-bhasd value for the maximum correction due 
equation of centre and evection is very accurate at syzygies, and reasonable at 


quadratures. 
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Variation in Earth-Moon Distance 


From (22) and using the value of 2? = a we find that the manda-hypotenuse 
in yojanás, Dj, has the values 10.9589R, 10.0385R and 9.1954R when M is 0° 
(Moon at apogee), 90° and 180° (Moon at perigee), respectively. 

At syzygy, 9us — 0s = 0° or 180°. Then from (21), D2 = Di + £ cos M, and it 
can be easily seen that the true distance of the Moon from the earth, D», has the values 
11.4589R, 10.0385R and 8.6954R when M is 0° (apogee), 90° and 180° (perigee), 
respectively. Hence, at syzygy, the ratio of the maximum and minimum distances 
of the Moon from earth is 1.3178 according to TantrasarigrahalYukti-bhasa. This 
ratio is 1.1925 in Ptolemy's theory, and 1.1884 in Ibn al-Shatir's theory. 


At quadrature, Qs — 0s = £90°. Then, D; = D; when M = 0? or 180°, and 


D, = JD? + E when M = +90°. The true lunar distance D2 has the values 
10.9589, 10.0509 R and 9.1584 when M is 0° (apogee), 90° and 180° (perigee), 
respectively. Hence, at quadrature, the ratio ofthe maximum and minimum distances 
of the Moon from earth is 1.1918 according to Tantrasangrahal/Yukti-bhasa. In 
Ptolemy’s model this ratio turns out to be 1.3112 and in Ibn al-Shatir’s model, 
1.3077.? 

Overall, the ratio of the maximum and minimum distances turns out to be 
1.3178, 1.9278 and 1.3112 according to TantrasangrahalYukti-bhasa, Ptolemy's 
model and Ibn al-Shatir’s theory respectively. ` 

Thus it is remarkable that Indian astronomers, from the time of Vatesvara (c.904) 
and Manjulacarya (c.932), arrived at a very accurate expression for the second lunar 
correction and this seems to have been based on a fairly reasonable geometrical 
picture of motion as revealed in the later Kerala works, Tantrasarigraha (c.1500) 
and Yukti-bhasa (c.1530). 
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Appendix 1 
Geocentric Coordinates of a Planet in Kepler’s Model 


The planetay models in ancient times can be appreciated better, if we understand 
how the geocentric coordinates of a planet is calculated in Kepler’s model, which is 
correct in its essentials even today [13]. The three laws discovered by Kepler in the 


early seventeenth century form the basis of our present understanding of planetary 
orbits. The three laws are: 


1. Each planet moves around the Sun in an ellipse, with Sun at one of the foci. 
2. The areal velocity of a planet in its orbit is a constant. 


3. The square of the time-period of a planet in its orbit is proportional to the 
cube of the semi-major axis of the ellipse in which it moves. 


Kepler’s laws can be derived from Newton’s second law of motion and law of 
gravitation’. If we take the time period of a planet as an input, Kepler’s third law 
does not play any role for an individual orbit. We should note that Kepler’s laws are 
essentially kinematical laws, which do not make any reference to the concepts of 
‘acceleration’ and ‘force’, as we understand them. Even then, they capture the very 
essence of the nature of planetary orbits and can be used to calculate the planetary 
coordinates, once we know the parameters of the ellipse and the initial coordinates. 
The planetary models in ancient astronomy are kinematical and in that sense they 
can be compared with Kepler’s model. In the following, we shall elaborate on the 
computation of the geocentric longitude and latitude of a planet in Kepler's model. 


Conversely, Newton’s inverse square law of gravitation can also be arrived at from Kepler's laws! 
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Figure 8: Elliptic orbit of a planet around the Sun. 


Elliptic Orbits and Equation of Centre 


The elliptic orbit of a planet (P) around the Sun (S, which is also one of the foci of 

the ellipse) is represented in Figure 8. a and b are the semi-major and semi-minor 

axes of the ellipse. I refers to the first point of Aries. w = TSA is the longitude o 

the aphelion (A). 0 = TSP is the heliocentric longitude of the planet. — 
Here e is the eccentricity of the ellipse and / = a(1 — e). : 

.. Then the equation of the ellipse (in polar coordinates, with the origi 

the foci), may be written as uique c c 


Therefore, 
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Using the expression for r? in (30), we find 
P6,[1 + 2ecos(8, — w)] = wab + O(e?). (32) 

Now! =a (1 — e°) =a + O(e?) and ab = a? + O(e?). Hence, 

6,[1 + 2e cos(, — w)] © a, (33) 
where the equation is correct to O(e). Integrating with respect to time, we obtain 

On + 2e sin@, — w)] 7 at, (34) 
or again, 


6 — Om = 6, — ot = —2e sin(Oy — w) + O(e?). (35) 


Here ĝm = ot is the mean longitude which increases linearly with time, t. ot — w, 
the difference between the longitudes of the mean planet and the apogee is the 
‘anomaly’. Eq.(35) gives the equation of centre which is the difference between the 


true heliocentric longitude 6, and the mean longitude ĝm, correct to O (e), in terms 
of the anomaly. Clearly, 


orbit. 
Oe 


the equation of centre arises due to the eccentricity of the 


l€ planets are inclined at small angles to the plane of Earth’s orbit 
ownas ecliptic. We will ignore these inclinations and assume that 
‘orbits lie on the plane of the ecliptic while calculating the planetary 
the Correction introduced by them (inclinations) are known to be 
ider the longitude of an exterior planet like Mars, Jupiter or 
Tcury or Venus, Separately. 
net (P) and that of the Earth (E) around the 
% = TÊP is the true heliocentric longitude 
“A are the true geocentric longitudes of the 


T ae 
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Figure 9: Heliocentric and geocentric longitudes of an exterior planetin — E 
Kepler's model. 


(3 Zh 
IO) 
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P 


eliocentric and geocentric longitudes of an interior planet in 
Kepler's model. 


and that of the Earth (E) around the 
Ni hin the | i 


€ 
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planet’s orbit 


Figure 11: Heliocentric and geocentric latitudes of a planet in Kepler’s model. 


Also, 
sin(SEP) _ sin(ESP) (45) 
SPS +t aeBe es 
Using (42)-(45) we get, 
O, — 6. 
sin(0, — 65) = zain, 22s) (46) 


[(R +r cos@, — 85) + r? sin (By = 67 


The difference (0, — 05) is determined from this equation. Adding this to 
get the true geocentric longitude, 6, of the planet. Note that the true lon; 
the Sun, 0s, and the true heliocentric std of the planet, 0j, (obtain 
the equation of centre for the planet to the mean heliocentric longi on 
should be obtained first. Then the true geocentric longitude Es 
be obtained using (46). É 5 
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Figure 12: Ptolemy's model of lunar motion. 


Bs ~ in sin(Ap — Ay), (49) 


1 Bs are small Note that 4 p stands for the true heliocentric longitude of the 
eh so shown Earth's orbit in the figure. The geocentric latitude Jg is 
eB cpi n 


Be = PEP = —, (50) 
ee Se EP 
We SE 


Ac 
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of the earth and C is the centre of the circle with radius R, at a distance r2 from O 
in the direction O A to be specified shortly. B is the mean Moon, whose geocentric 
longitude y = l'OB increases uniformly with respect to O and not C, though it 
is at a constant distance from C. The direction of OA is chosen such that AO B = 
2(0y — 05), where Os is the mean longitude of the Sun. Draw a circle of radius r with 
B as the centre. This is the epicycle corresponding to the equation of centre. The 
true Moon P is located on the epicycle such that B P is in the direction of Moon's 
apogee. Then the true longitude of the Moon, 6, = TOP. This is a combined 
‘eccentric-epicycle’ model. Extend OB to OQ, such that P Q is perpendicular to 
OQ. Then PBQ = Oy — w = M is the anomaly, where w is the longitude of 
Moon’s apogee. 

The difference between the true and mean longitudes of the Moon, 6y — 0, = 
TÔB —TOP = POB, satisfies the relation: 


OP sin@y —0,)) = PQ =r sin M. (52) 


Now OP = (OB +r cos MP + r? sin? M. (53) 


Here O B is to be determined. Draw C N perpendicular to OB. As C ÔN = 2(0m — 
65), we have CN = rz sin2@y — 05) and ON = rz cos2(0j, — 63). Noting that 
NB = VCB? — CN2, with CB = R, we have 


OB = NB + ON = || R? — r2 sin?(2(0u —4s)) + r2 cos(2(0m —65)). (54) 


Then the correction to the mean longitude, 0, — Oy is to be determined from 


r sinM 


(OB +r cos M) 4 r? si M^ 


sin@y — 0,) = (55) 


Where O B is given by (54). 

The above equation (55) incorporates both the ‘equation of centre’ and 'evection' 
terms. The expression for 6, — Oy does not reduce to the modern form (27) for the 
Sum of these terms, in any reasonable approximation. However at syzygy, the 
Correction has the same form as the sin M term in the equation of centre with 
OB=R+y, replacing R. The second correction would be zero at quadrature also, 
when M = 0° or 180° (at perigee or apogee of the epicycle), just as in the modern 
theory. The second correction has the maximum value when M = 90°. 

Ptolemy gives the maximum value of the total correction to be 301” at syzygy, 
Compared to the modern value of 300.9’. At quadrature the maximum is given to 
be 460? compared to the modern value of 453.76’. At other values of ĝm — 0s and 
anomaly, Ptolemy’s theory would not be impressive, as far as the correction to the 
mean longitude is concerned. 
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The values of 7; and E can be calculated from the maximum corrections at the 
syzygy and quadrature. From (54) and (55), it can be easily checked that 


at syzygy, tan(@y — 9) maximum = RETS (56) 
r 
and at quadrature, tan (Ôm — O,)|maximum = eme (57) 
mu 


From the given values of the maximum values, we find that 7; = 0.1063 and 
% = 0.2106 in Ptolemy's model. 


Variation in Earth-Moon Distance 


Now at syzygy, OB = R + r2 . Then OP = R+72+,7, when M = 0° (apogee of 
the epicycle), and OP = R + rz — r, when M = 180° (perigee of the epicycle). 
Similarly, at quadrature, OB = R — r2. Then OP = R — r} +r when M = 0° 
(apogee of the epicycle), and OP = R — r; — r when M = 180° (perigee of the 
epicycle). Using the values of 7 and 3 above, we find that the ratio of the maximum 
and minimum distances of the Moon from earth is 1.1925 at syzygy, 1.3112 at 
quadrature, and 1.9278 overall. This is of course absurd. Hence, Ptolemy's model 
can be hardly taken to be a serious physical model. This is one of the reasons for 
the search for an alternate lunar theory in the later Islamic tradition of astronomy, 
which otherwise followed Ptolemaic framework. 


Appendix 3 
The Lunar Theory of Ibn al-Shatir 


Ibn al-Shatir (1304-1375/6 CE) developed a lunar theory in his Kitab Nihayat as-Sul 

fi Tashih.al-Usül , where he dispenses with the eccentric deferent in Ptolemy's theory 
and introduces instead a second epicycle [10]. His model is illustrated in Figure 13. 
Here O is the centre of the earth. The mean Moon moves on a deferent circle of 
radius R. Consider the case when the mean Moon, mean Sun and Moon's apogee 
are in the same direction. Here the mean Moon is at A (OA = R). Draw a circle of 
radius a around A, which is the epicycle corresponding to the ‘first inequality’. A1 
is apogee of the epicycle. Draw a second epicycle of radius b around A,. Then the 
true Moon is at B which is the perigee of the second epicycle. 

Consider the case when the mean Moon is at Po, and the anomaly, AO Py = M. 
Draw PoA? = a in the direction of Moon’s apogee. Draw A;P = b, such that 
PoÁ;P = 2(@y—6s), thatis, the radius of the second epicycle rotates attwice the rate 
of elongation of the mean Moon from the mean Sun, eastwards (counterclockwise) 
from the direction of the apogee. Then the true Moon is located at P. Draw T Q 
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Figure 13: Ibn al-Shatir's model for the two corrections to the Moon. 


through P and Po Q; perpendicular to OA and P)A2. Let ¢ be the angle between 
the directions of the true Moon (O P) and the apogee (O Pj). Let 0, be Moon's true 
longitude, and 69 = 6, — Oy be the total inequality. Then the ‘total inequality’ 60 
can be expressed as 


50 = 6, — 0u = —P ÔP) = ġ — M (58) 


We derive the expression for the total inequality in the above geometrical model, 
and simplify it when only terms which are first order in 7; and 2 are retained. This 
will facilitate comparison with the modern expression for the lunar inequalities 
given by (26) and (27). Now, 


Hence, 


AoT 


QQ; 


TO = 


PoQi = Rsin M, 


ie = A;P sin(2(0y = 8) = b sin(2(0m = 8s)). 


PQ=TQ-—-TP 

= A3P cos(2(Üy — 9;)) 
PoA2 
Sr = PA ot 
` OQ, 

00 = OQ; + 201 


(59) 


= RsinM —bsin(2(Qy — 0s)) (60) 
= bcos(2(@y — 05), 

LEN. 

= a-—bcos(2(@y — 05), 

= RcosM, 

= RcosM +a -— bcosQ(0y — 05)). (61) 
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Hence 


+ PQ [s R sin M — bsin(2(0y — 05)) 
Ban) 00 ^ Rcos M +a — bcos(2(By — 4s)) (62) 


Then 06 is found from (58), where ¢ is computed as above. 
To the first order in 4 and 2, 


b. 1 a — bcos(2(0y — 0 
tan(j) ~ [sinM — = sin(2@y — 9s))] Rae ris = UM — 9 
1 
~ tanM — — — [asi i ^ 
an M — eos y la sin M + b sin M cos(2@u — 65) 
— bcosM sin(2(6y — 05))] 
@. b. 
az = cms sin M + R sin(2(@y — 05) — M)). (63) 


= When the total correction is small, 


$ — M ~ tan($ — M) ~ (tan¢ — tan M) cos? M. 


b. 2b . 
sin M 4- R sin@y — 0s) cos(@s — w)], (64) 


t lon it ude of Moon’s apogee ( M = Oy — w). 
i bove with the modern expression (27) for the lunar inequalities, 
; sen e sin(2M) term in Ibn al-Shatir’s theory. 2 and 2 are 


xs 


eras 7 


us T El 
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Variation of Earth-Moon Distance 


From Figure 13, the distance of the Moon from earth, D is given by 
D’ = PQ +00. (66) 
Using (61) and (62), we have 


D? = R? xa? b? -2aR cos M —2bR cos2(8y —8s)— M) —2ab cos(2(8y —9s)). 

(67) 
From this, it is easy to show that the maximum and minimum distances at syzygy 
are R + (a — b) and R — (a — b) respectively. Then, with the given values of a and b, 
the ratio of the maximum and minimum distances at syzygy is found to be 1.1884. 
Similarly, at quadrature, the maximum and minimum distances are R + (a + b) 
and R — (a + b). The ratio of the maximum and minimum distances at quadrature 
is then, 1.3077. This is also the ratio of the maximum and minimum distances, 
overall. Clearly, Ibn al-Shatir does manage to modify the Ptolemaic model so as to 
getrid of the totally unreasonable variation in the Earth-Moon distance which was a 
longstanding problem in the development of lunar theory in the Greek and Islamic 
traditions which followed the Ptolemaic framework. 
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(from Brahmagupta (c.628 CE) to John Wallis (1685 CE)): Kuttaka, 
Bhavana and Cakravala (the techniques developed by Indian 
mathematicians for the solution of indeterminate equations); 
Development of calculus in India (covering the millennium long history 
of discoveries culminating in the work of the Kerala School giving a 
complete analysis of the basic calculus of polynomial and trigonometric 
functions); Recursive methods in Indian Mathematics (going back to 
Panini's grammar and culminating in the recursive proofs found in the 
Malayalam Text Yuktibhasa (1530 CE)); and Planetary and Lunar models 
developed by the Kerala School of Astronomy. These articles in their 
totality cover a substantial portion of the History of Indian Mathematics 
and Astronomy. 


This book would serve the dual purpose of bringing to the international 
community a better perspective of the mathematical heritage of India, and 
at the same time convey the message that much work yet remains to be 
done, namely the study of many unexplored manuscripts still available in 
libraries in India and abroad. 
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